Journal of Educational 
Psychology 


Raymonp G. KUHBLEN, EDITOR 
Syracuse University 


Advisory Editors 


Joun B. CARROLL Carson McGure 
Harvard University University of Texas 

Lee J. CRONBACH Pau, H. Mussen 
University of Illinois University of Cali- 


Francis J. DiVesta fornia, Berkeley 
Syracuse University T. Ernest NEWLAND 


University of Illinois 


CuHEsTER W. Harris H. H. Resnrers 
University of Wisconsin ‘Purdue University 

Ernest R. HitcarD Juuran C. STaNLEY 
Stanford University University of Wisconsin 

Nicnoras Hopps J. B. Srroup 
George Peabody College State University of 
Sor Teachers Iowa 

Irvinc Lorce Donatp E. SUPER 
Teachers College, Teachers College, 
Columbia University 4 Columbia University 

DororHea McCartHy Grorcg G. THOMPSON 
Fordham University Ohio State University 


» 


VOLUME 52, 1961 


Artuur C, HorrMan, Production Manager 
Herren Orr, Promotion Manager 


4 
PUBLISHED BIMONTHLY BY 
THE AMERICAN PSYCHOLOGICAL ASSOCIATION, INC. 
1333 SIXTEENTH STREET N.W. 


WASHINGTON 6, D. C. 
Copyright 1961, by the American Psychological Association, Inc. 


CONTENTS OF VOLUME 52 


Arxen, Lewis R., Jr., anp Drecer, Rapa Mason. The Effect of Attitudes on Per- 


Sormmanee tes Mal Sait ei ic cicorcasiuseis ne xa ene gra ecseraiwraranvaiaie wie qin a arate prem am are arta 19 
Amipon, Epmunp, AND Fianpers, Nep A. The Effects of Direct and Indirect Teacher 
Influence on Dependent-Prone Students Learning Geometry...............-...0005 286 
Astin, ALEXANDER W. A Re-examination of College Productivity..................+4 173 
Astin, ALEXANDER W., AND Hotuanp, Joun L. The Environmental Assessment Tech- 
nique: A Way to Measure College Miva ow Daen tse ssa cs asciccnsarsicarsoicnpewintersrarevarerimarers nesta 308 
AusuzeEL, Davin P., anp Frrzceratp, Donat. The Role of Discriminability in Mean- 
ingful Verbal Learning and Retention. .............20-.ecceeeceeseeeeeseeutenees 266 


Bennie, A. W. See Meyer, WiLL1Am J. 

Buu, Sruarr H. The Desire for Security: An Element in the Vocational Choice of 
College: Metis ssacaiea sss sawnssarntes wails anenaeauiios as aie apnameetneusansmteeninanistetae 317 

Bonxowsxt, Rosert J. See Pimsiteur, Paun 

Bosrrum, Roserv N., VLANpis, Joun W., AND RoseNBAUM, Mitton E. Grades as Rein- 


forcing Contingencies and Attitude Chan Gee geccsknas o6 $6 Ge ROMER ARITA ERGASTEN 112 
Brapway, Karnerine P., anp Rosrnson, Nancy M. Significant IQ Changes in Twenty- 

TEL YGs Venues Ak BoM Ow Ul picasa sian ¥:nesricevsvanians xd ercraversavsqnotaraiatorns’ aa. enecgocesnvsejmacasinnseies TOMS 74 
Campseiy, Rosperr E., AND Horrocks, Jonn E. A Note on Relationships between 

Student and Parent Minnesota Teacher Attitude Inventory Scores.............++: 199 


Carrer, Lamorp J. See Srroup, James B. 

Coutson, Joun E. See Srpperman, Harry F. 

Croox, Ropurr. See Taytor, Marvin. 

Damrrn, Dora E. See Runxet, Pari J. 

Dreecrr, Rapa Mason. See Arken, Lewis R., JR. 

Drorxin, Sranuey. See Taytor, Marvin. 

Dyer, Dororny W., anp Harcum, E. Ras. Visual Perception of Binary Patterns by 
Preschool Children and by Sohool: Chil bettie cesses seit escrsisnauieeinsaesosey. sieie 161 

Esravan, Donaup. See Sinperman, Harry F. 

Firzgeratp, Donaxp. See AusuBEL, Davip P. 

Fuanpers, Nup A. See Amipon, Epmunp. 

Guascow, Gzorcr M. The Effects of Manner of Speech on Appreciation of Spoken 


UNG OPER hace nes saci isixcecivtoieelotb nis Hoossiameationins iciainanssenserastec eee bones 322 
Gorrues, Davip. American Graduate Students: Some Characteristics of Aspiring 
Weschers:and (Researghets asx rs sc neste as os Suseisanquanics va Sucieehropeeaerewenrnscure nel 236 


Categorization and Related Verbalizations in Deaf and Hearing Adolescents...... 188 


Kazmrer, Leonarp J. Cross-Validation Groups, Extreme Groups, and the Prediction 
GE Acad dinitg AC HIC AION Gs ie. oy cacy orciccorersean ine piesprseesersremssa tare iw ia-s hire acinar ae 195 
Kuausmerer, Herserr J., anv Loveutrn, Leo J. Behaviors during Problem Solving 
among Children of Low, Average, and High Intelligence...........--.+.+++e0000000* 
Korprrz, Euizaseta M., Marpis, Verpena, anp STEPHENS, THoMAS. A Note on 
Screening School Beginners with the Bender Gestalt Test.........-+++0++s0e0r0ee+ 


148 


80 


iii 


iv CONTENTS OF VOLUME 52 


Kounrn, Jacos S., anp Guar, Paux V. The Comparative Influence of Punitive and 
Nonpunitive Teachers upon Childrens’ Concepts of School Misconduct............ 44 

Krurex, Girpert K. See Napier, Evcene B. 

Locxarp, Joan, AND Stpowsk1, Josrpu B. Learning in Fourth and Sixth Graders as a 
Function of Sensory Mode of Stimulus Presentation and Overt or Covert Practice.. 262 

Lovucnun, Leo J. See Krausmerer, Herserr J. 


McF rx, Anne. The Relation of Students’ Needs to Their Perceptions of a College 
Environment 


WA eae A IM eae BES IU US ZS are sie gig ate cage tens sto 25 

McGuire, Carson. Sex Role and Communit: 61 

McGuire, Carson. See VELDMAN, DonaLp 

Marpis, VerpEna. See Korrirz, Exizapeta M. 

Mepinnus, Gene R. The Relation between Several Parent Measures and the Child’s 
Bey HAitiatmnent Lo Babio od excises isa shan an ve28 anim ve 9 40 GSSilannamnnoven enon ere 153 


Micuazrt, James. See Karns, Sours L. te 
Miner, Joun B. On the Use of a Short Vocabulary Test to Measure General 

ESE HG 38 te ip weeeaecoee ep NEU ye ning nesahcenMtnagentigamanwonseng ws ia 4 cae 157 
MitcHeE.1, James V., Jr. An Analysis of the Factorial Dimension of the Achievement 
Motivation Construct 


Eee ne gOS SRS RA Bn tp emraternvineitieaicwaan vs sg gaits, 1ID 
Murat, Steemar. The Effects of Visual Discrimination Pretraining with Word and 


Intelli- 


Napier, Evcene B., anp Kruten, Giuperr K, Personality Factors among Science 
and Technology Freshmen 


BSE pis alaeene 223 


Peck, Roserr F. See VELDMAN, Donatp J. 
Prager, Jean. The Genetic Approach to the Psychology of Thought 


6 Syd woe ah Re es 275 
Pimsieur, Paun, anp Bonxowsxr, Roserr J, Transfer of Verbal Materj 
en RAE i ac om oe 0 09 38 Kn ncrumoempegmac a ie atte 104 
\Rensz, Hayne W. Manifest Anxiety and Achievement Test Performance..." 132 
Rosinson, Nancy M. See Brapway, Katuerine i 
Rosrnpaum, Miuron B. See Bostrom, Rosert N, 
RosENFELD, Howarp, anp Zan ioe 
Of Students ...ccce scecsensee eee muene® of Teachers on Aspirations 1 
Runxex, Parr J., anp Damen, Dora E Effects of: Trai ning aiid Auieione eo 
, , ning and 
Teachers’ Preferences for Information sbout Students. .. . eileen 254 
Rounaquist, Wintarp N., anp Hurt, Vatorrz H Verbal Concept Learaine i ics 
hg , a oncept L i 
School Students with Pictorial and Verbal Representation of Samat — 108 
Ryans, Davin G. Inventory Estimated Teacher Characteristics as Cc Syatian ts of 
Observer Assessed Pupil Behavior........,...  aciamaits 91 
Ryans, Davin G. Some Relationships b Pupil Behan jemand Gatos fee 
Characteristics... td eaeeeiasins ics é annie eibieniaas ee tien 
Sarason, Irwin G, Test Anxiety and the Intellectual Performer cf ccm eg = 


ECHREST, Lex. See Suors, Evcens, 


sw 


CONTENTS OF VOLUME 6&2 v 


Sipowsk1, Josrrn B. See Lockarp, Joan. 
Siuperman, Harry F., Metaracno, Raps J., Coutson, JoHNn E., anp Estavan, 
Donaup. Fixed Sequence versus Branching Autoinstructional Methods.......... 166 
Syan, Anwar. See Tuomas, R. Murray. 
Steruens, Tuomas. See Korrrrz, Evizapetn M. 
Stroup, Jamwxs B., anv Carrer, Lamore J. Inhibition Phenomena in Fast and Slow 
US RGR accra pn: otriceutcttl sia aiaardrctrar gto ag ats BORE RMR RRNSSERG NG Beatson at Mises 30 
Srrovp, James B. See Wour, WILLAVENE. 
Surer, Donaup E. Consistency and Wisdom of Vocational Preference as Indices of 


Vocational Maturity in the Ninth Grade...................ccccceeeceececceeecs ce 35 
TAYLor, Marvin, Croox, Ropert, AnD Dropxin, Stantey. Assessing Emerging 

Leadership Behavior in Small Discussion GEOUPS): 3 ss 3 os ch sa Sensreauaancuaees 12 
Tuomas, R. Murray, anp Ssan, Anwar. The Draw-A-Man Test in Indonesia........ 232 


\Torrancer, E. Pau, anp Harmon, Jupson A. Effects of Memory, Evaluative, and 


Creative Reading Sets on Test Performance. ..........0.0ceccsceseeeseseeeeeeeees 207—~ 


VetpMan, Donatp J., Peck, Ropert F., anp McGuire, Carson. Measuring the 
Value Systems of Education Professors.........2...0ccsceceveeeseceveeusereusees 330 

VuaNpIs, Joun W. See Bostrom, Roper N. 

Watsu, TERRENCE M. See Kares, Sours L. 

Wasuzunrn, Witzur C. Patterns of Self-Conceptualization in High School and College 
PS UAL Bessie say oe erie OG aed ihe ties Peet ae a a armrest eter 


Wesster, Sraren W. The Influence of Interracial Contact on Social Acceptance in 


a Newly Integrated Sd) Coe Miscawee 292 
Wotr, WILLAVENE, AND Stroup, James B. The Contribution of Response in Mental 7 
AMORA URED ete lesen tort itp sata RCo a Sale RIOG Sa ve STON 249 


Zanver, Auvin. See RosENFELD, Howarp. 


JOURNAL OF 
EDUCATIONAL PSYCHOLOGY 


Volume 52 


February 1961 


Number 1 


THE INFLUENCE OF TEACHERS ON ASPIRATIONS 
OF STUDENTS! 


HOWARD ROSENFELD anv ALVIN ZANDER 


University of Michigan 


Early studies of goal setting have 
shown that a person usually selects a 
level of aspiration which represents a 
mild challenge for him, given that he 
has knowledge of his past performances 
(Lewin, Dembo, Festinger, & Sears, 
1944). In general, he attempts to reach 
the most rewarding goal that he feels 
he can reasonably attain. It is also 
known that aspirations are often in- 
onc ¥ sources external to the 

‘sons who set them (Festinger. 
1942). Little evidence ao is ale 
able concerning the nature of this in- 
fluence process. 

The interest of this study in the 
effects of teachers on a student’s as- 
Pirations was stimulated by the above 
problem and by the awareness that 
Appropriate goal setting by a student 
18 an important practical issue. A 
salient dimension on which a pupil 
can hardly avoid setting a level of as- 
piration, and can hardly avoid being 
influenced by his teacher when doing 
80, concerns his grades in school. We 
define a student’s level of aspiration as 
that level of achievement, indicated 
by a grade, which a pupil realistically 
expects to attain in a given course. 

_ For the purposes of this investiga- 
tion, we assume that teachers attempt 
to influence pupils to work up to their 
Capacities, and that these pressures 


P ‘The research reported herein was per- 

one pursuant to a contract with the 
ce of Education, United States Depart- 

Ment of Health, Education, and Welfare. 


are guided by professional norms that 
students should not be expected to 
work beyond or below their capacities. 
We assume that any given student is 
reasonably confident he knows the 
level of his best possible performance 
(his capacity) and that he believes his 
teacher knows his capacity equally 
well. We further assume that students 
are aware of the pressures from 
teachers and that students believe 
they should not be pressed to aspire 
either beyond or below their capacities. 

The attempts of teachers to influence 
students are usually based upon as- 
sumptions as to what will motivate 
pupils to accept these inductions, such 
as rewards, punishments, provision of 
relevant information, and so on. 
French and Raven (1959) have pro- 
posed five separate bases of social 
power whose effectiveness depends 
upon the degree that they stimulate 
forces in the recipient of the influence 
attempts to act in accord with these 
inductions, minus the degree they 
generate forces in the recipient to re- 
sist these inductions. A summary of 
studies on the consequences of these 
different forms of social power is pro- 
vided by French and Raven (1959) and 
by Cartwright (1959). The hypotheses 
éonsidered in the present investigation 
are in large part suggested by French 
and Raven. The’derivations of these 
hypotheses are described by those 
authors and will not be discussed here. 

A secondary concern of this study is 


2 HOWARD ROSENFELD AND ALVIN ZANDER 


in the students’ perceptions of the 
valence and the probability of per- 
forming up to his capacity, and how 
these variables intervene between in- 
fluence attempts by teachers and the 
setting of aspirations by students. This 
interest in the consequences of per- 
ceived valence and probability stems 
from assumptions in Lewin’s theory of 
aspiration setting that the level of 
aspiration is a function of the positive 
valence of succeeding, the negative 
valence of failing, and the probability 
of succeeding or failing. Probability is 
measured here in terms of the per- 
ceived difficulty of achieving a capac- 
ity performance. 

Finally, we consider the effects of dif- 
ferent forms of social power upon 
attitudes toward teachers and the sub- 
ject matter of the course, Hypotheses 
tested here were, for the most part, 
proposed by French and Raven. 

The present theoretical 
may be summarized in 
model, the terms of wi 
plained below. 


orientation 
the following 
hich are ex- 


By form of influence we refer to the 
bases of social power Proposed by 
French and Raven: reward, coercion, 
expert, legitimate, and referent. Each 
of these is defined in the presentation 
of the results. Valence of capacity is 
the degree that a student perceives the 
attainment of his capacity grade as an 
attractive goal. Difficulty refers to his 
perception of the probability that the 
goal is attainable. It is assumed that 
the degree of difficulty perceived by a 
student is determined not only by his 
personal ability, but also by external 
barriers such as the competence of 


Decision making by Student 
Valence of Capacit; 

. Vg 
Difficulty of capaci 


teachers and the sufficiency of time and 
help in doing assignments. Degree of 
congruence between aspired grade and 
capacity grade is the distance between 
the aspired grade and the capacity 
grade, divided by 1. 


Mernop 


Due to the exploratory nature of this re- 
Search and the large number of concepts in- 
volved, a written questionnaire was selected 
as the basic instrument. Although it is diffi- 
cult to specify with confidence the direction 
of causality in our correlational results, hy- 
potheses and empirical evidence developed 
in other settings can be used in interpreting 
the most probable direction of causality. 
Conclusions will be stated in terms of hy- 
potheses deemed worthy of study under 
more controlled conditions. 

_The relevant concepts were measured 
with questions utilizing Likert-type scales. 
reliminary versions of this questionnaire 
Were developed and revised on the basis 0 
Intensive interviews with students. The 
questionnaire focused on the aspirations 0 
Students in mathematics classes and on the 
relationship of students to teachers of math- 
ematics, Mathematics was selected since 
this course is required of all respondents and 
because there should be little ambiguity 10 


Decision of student 


29-9419) svseeuese Congruence be- 

ty tween student’s 
aspirations an 
capacity 


Students’ min 


ds over the nature of a good 
Performance 


~~. .compared to courses using 

More subjective criteria for evaluation of 
fos b Progress. The questions were repeated 
staseer Courses in order to examine the 
coitizes of a different course content and the 
upon ee of the greater social emphasis 
Bu sedi voment in mathematics which 
on Y characterizes contemporary so- 
Four-h 
each of f, 
the sam 


undred male students, 100 from 
Cur junior high schools comprised 
ple. Tenth graders were selected be- 
cause they include the oldest group with a 
wide distribution of ability. The four schools 
Were chosen to provide a wide Tange of socio- 
economic status and ability of Students, A 
comparison of results for the Predicted rela- 


TEACHERS’ INFLUENCE ON STUDENT ASPIRATIONS 3 


tionships within each of the four schools re- 
vealed that no important differences exist 
among the schools insofar as the present 
data are coneerned. 

Respondents in each school filled-out the 
questionnaire during a 1.5-hour period while 
teachers were absent from the testing room. 
The administration of the questionnaire uni- 
formly occurred shortly after students had 
received their grades for the fourth of six 
marking periods. The recent knowledge of 
their grades provided the students with sta- 
ble evidence of their present level of per- 
formance while permitting the possibility of 
future changes in the grades and in the 
students’ aspirations for grades. 


Validation of Assumptions 


The interpretation of results in this study 
rests upon assumptions stated earlier about 
teachers’ intentions and students’ percep- 
tions of teachers’ acts. The results, on the 
whole, support the reasonableness of these 
assumptions. 

It was assumed that students believe 
their capacity is known by teachers. Evi- 
dence suggesting that this assumption is 
warranted for the purposes of this study is 
provided by the pupils’ answers to the 
query : ‘Do you think your teacher is a good 
Judge of your ability?” In the responses 67% 
of the students described their teachers as 

quite good” or ‘“‘very good” judges of their 
abilities. It was also assumed that students 
are aware of their own capacities and that 
they support the norm that they should not 
be expected to perform at levels beyond 
capacity. Support for these assumptions is 
found in responses to the question: ‘How 
reasonable is your teacher in how well he ex- 
pects you to do?”’ Seventy-nine percent of 
the students replied that their teacher “ex- 
pects about the right amount from me,” 
14% felt that the teacher required “too 
much,” and 7% answered ‘‘too little.’ These 
percentages also provide indirect support 
for the assumption that teachers are guided 
by professional norms holding that students 
should not be expected to work beyond or 
below capacity. 

A further assumption was that teachers 
expect pupils to work up to their capacity 
level. Over 80% in each of the schools per- 
ceived their teacher as expecting they could 
Perform at the level of their capacity, while 
less than 1% said that they had no idea what 
their teacher expected of them. Most stu- 

‘ents, however, did not view these as strong 
demands. In response to the question: ‘Has 


your math teacher ‘pushed’ you to work to- 
ward this [capacity] grade?” 55% said that 
they felt ‘‘little or no pressure’’ while only 
14% said that they felt “quite a lot”’ of pres- 
sure. 

The aspirations of students may be 
strongly determined by their confidence in 
themselves. We felt it was important, there- 
fore, to determine what effect this personal- 
ity characteristic might have upon students’ 
attitudes toward influence attempts by 
teachers as well as students aspirations and 
achievements. A standardized measurement 
of test-anxiety prepared by Mandler and 
Sarason (1952) was administered to all sub- 
jects. High scores on this measure indicate 
a high fear of failure. 

While test-anxiety was not correlated 
with students’ perceptions of teachers’ in- 
fluence, it was related to aspiration setting 
in a way consistent with findings in previous 
research on the level of aspiration. Atkinson 
has found that persons with high test-anxi- 
ety tend to avoid moderate risks (Atkinson, 
1957). The present data show that students 
with high test-anxiety set aspirations far- 
ther from their present level of performance 
than do students with low test-anxiety. The 
goals set by the more anxious students were 
often unrealistically high. The high goals 
are taken to be unrealistic since it was noted 
that the farther students set their aspira- 
tions from their current grades in mathe- 
matics, the less they were likely to attain 
their levels of aspiration in their final grades 
for the course (r = —.63**).? 


RESULTS 


How much were the students com- 
mitting themselves when they stated 
their levels of aspiration? To answer 
this question, aspired grades were 
compared with the grades actually 
received by students at the end of the 
semester. The substantial relationship 
between aspired grades and those re- 
ceived at the end of the semester (7 = 
.66**) suggests that the aspirations 
were realistic among a majority of stu- 
dents. The aspired grades usually were 
set from one-third to one whole letter 


2 Correlation coefficients reported in text 
and tables are marked by asterisks to indi- 
eate the probability values at the .05 (*) 
and .01 or less (**) levels of significance, 
two-tailed test. 


4 HOWARD ROSENFELD AND ALVIN ZANDER 


grade higher than grades received for 
immediately past performances, a 
typical phenomenon in setting aspira- 
tion levels indicating a desire for future 
improvement. 

Our principal concern is in the re- 
lationships of the separate forms of 
power exerted by teachers (as students 
view these matters) with the congru- 
ence between students’ stated aspira- 
tions for grades and their perceived 
capacities. 

Perceived level of capacity was meas- 
ured by the query: 


Not everyone can get A for a final grade 
in mathematics. Many students know that 
they must get something less because every- 
one has an upper limit to his ability. What 
final grade do you think you could get if you 
worked to the limit of your ability and did 
the best you could in mathematics for the 
rest of the semester? 


Aspired grade was obtained by the 
query : 


Students do not always feel like doing 
their best in a certain class. Sometimes they 
are willing to accept a grade which is not as 
good as they could get if they really tried. 
The final grade you will get in your math 
class this semester will depend partly on how 
hard you are going to work for the rest of 
this semester. Considering how hard you 
plan to work, what final grade do you think 
you should get in math this year? 


Effects of Separate Bases of Power 


The most direct attempts to in- 
fluence students are based on the use of 
sanctions: by rewarding or coercing. 
Rewards are given or promised for 
behavior which is in accord with the 
wishes of the inducer. Coercion, based 
upon the ability to punish, is exerted or 
threatened for behavior that is not in 
accord with the wishes of the inducer. 
In a school, a teacher may administer 
sanctions in many ways: by the grades 
he gives, by comments or signs made to 
students, by exclusion from the group 


TABLE 1 


CoRRELATIONS BETWEEN FORMS OF 
PowER AND CONGRUENCE 


(N = 415) 
Congruence of 
Form of power attributed to teacher aden 
and capacity 
Comparative degree of reward- 221 
ing sanction 
Discriminate reward, fre- 07 
quency 
Indiscriminate reward, fre- 03 
quency 
Discriminate coercion, fre- 09 
quency 
Indiscriminate coercion, fre- | —.14** 
quency 
Legitimacy of grading 221" 
Expertness in grading -09 
Referent status of teacher ge 


**p< 01. 


or assignment of responsibilities in 
class, by reports to authorities or 
parents, and the like. In order to en- 
compass this variety of approving or 
disapproving cues, it was found neces- 
sary to cast questions about sanction- 
ing acts by teachers in terms of the ap- 
proval or disapproval that students 
perceived teachers have toward them. 
A general measure of the degree that 


sanctioning was percej . 
u Ceived as rewar 
or coercive was lent: 


01 sought with the query: 
: et the whole, how much do ind feel 
5 ¥ se math teacher is pleased (or 

Ispleased) with you compared to the 
= of the class?” Ty, was expected that 
ee Oey the relative approval re- 
vue y the student, the greater 
jon e the congruence between his 
: Pired grade and his capacity grade. 
n the first row of Table 1 it b 
seen that this non 


expectation is supported. 
Although reward and cpraten, can. 
be conceived as the extreme ends of a 


Single dimension, French 
note that “the distinct ee sen 


TEACHERS’ INFLUENCE ON STUDENT ASPIRATIONS 5 


because the dynamics of them are dif- 
ferent.” Extreme punishment of a 
person, for example, may lead him to 
avoid or escape the whole situation in 
which it operates, while receipt of a 
valued reward may make the situation 
more attractive to him. Thus, measures 
were made of the degree that teachers 
reward (“Does your math teacher 
seem to be pleased when you do your 
best?”) and of the degree that teachers 
coerce (“Does your math teacher seem 
displeased when you don’t try very 
hard and your work is not as good as it 
could be?”’). Each of these questions 
was answered on a frequency scale. 
Because the reward and coercion in 
these instances are being given where 
they are ordinarily taken to be appro- 
priate reactions, they are designated 
as discriminate sanctions to distinguish 
them from indiscriminate sanctions, 
discussed in a moment. On the second 
and fourth lines of Table 1 are shown 
correlations between frequency of dis- 
criminate sanctions and congruence. 
The low and nonsignificant correla- 
tions suggest that the degree of dis- 
criminate reward or coercion that 
teachers were perceived to use do not 
affect the degree of congruency be- 
tween aspirations and capacity. 

Ab, wo further concepts relevant to 
sanctioning behavior were investigated. 
The first is indiscriminate reward: ‘Is 
your math teacher ever pleased with 
your work even when you don’t try 
hard?” The second is indiscriminate 
coercion: “Does your math teacher 
ever seem to be displeased with you 
even when you do your best in class?” 
It can be seen in Table 1 that indis- 
criminate reward has no apparent ef- 
fect upon congruence while indiscrimi- 
nate coercion is inversely related to 
congruence. 

_ This last finding suggests that in- 
discriminate coercion arouses stronger 
tendencies to resist the teacher’s in- 


TABLE 2 
CoRRELATIONS BETWEEN ForMs OF 
SANncTION AND PowER, AND DESIRE 

to ConrorM AND LEGITIMACY 


(N = 415) 
Form of power attributed Desire to a " 
to teacher conform | Legitimacy 
Discriminate coercion ae -06 
Indiscriminate coer- —.19** | —.24** 
cion 
Difference** 
Discriminate reward -29** le 
Indiscriminate reward | —.11* | —.03 
Difference** 


*p < .05. 
**p <.01. 


ductions, than to accept them. To test 
such an hypothesis, students were 
asked: “How often do you feel like 
doing the things your math teacher 
wants you to do?” Results relevant to 
this hypothesis are shown in the first 
column in Table 2. It is plain that stu- 
dents perceived themselves as less 
ready to conform to a teacher’s de- 
sires when coercion was indiscriminate 
than when it was discriminate. 

It has been found in other research 
that resistance to influence becomes 
greater as legitimacy of influence de- 
creases (see French and Raven). The 
legitimacy of social sanctioning stems 
from the perception, in those being 
influenced, that the influencer is be- 
having in accord with the internalized 
values of the ones being influenced. The 
degree of legitimacy attributed to 
teachers’ behavior was measured by 
the query: ‘How fair is your mathe- 
matics teacher about most things?” 
The correlations between legitimacy 
and the separate forms of sanctioning 
are shown in the second column of 
Table 2. It can be seen that discrimi- 
nate reward is reliably related to 
legitimacy but indiscriminate reward 
has no relationship with legitimacy. 


6 HOWARD ROSENFELD 


Furthermore, indiscriminate coercion 
is reliably associated with nonlegiti- 
macy while discriminate coercion has 
almost no association with legitimacy. 
We conclude that teachers who are 
considered discriminate in rewards are 
likely to be seen as “‘fair,” while those 
who are indiscriminate in coercion are 
likely to be seen as ‘‘unfair.” 

French and Raven hypothesize that 
coercion arouses strong resistance in 
the recipient of it so that the inducer’s 
desires are not always acted upon by 
the recipients and, depending upon the 
strength of the resistance aroused, the 
recipients may instead be stimulated 
to do the opposite of what has been 
asked of them. This hypothesis has 
been corroborated by Zipf (1958) and 
Sampson (1960). The present findings 
(and those to be seen in Table 3) sug- 
gest that resistance to coercion in the 
school setting may more readily gen- 
erate negativism when the coercion is 
indiscriminate. 

Legitimate power, we have seen, 
stems from the perception that an in- 
fluencer is behaving in accord with the 
values of the person being influenced. 
In some instances acts by a teacher 
may be perceived as legitimate or non- 
legitimate without their being direct 
attempts to influence the student. An 
example of this type of legitimacy con- 
cerns the fairness of the teacher in 
evaluating the student’s work. The 
degree of this form of legitimacy attrib- 
uted to teachers was measured by the 
question: “If you did your best in math 
class would your teacher actually give 
you the grade that describes your 
ability?” It was expected that students 
who perceived their teacher as more 
legitimate would tend to set levels of 
aspiration closer to their perceived 
capacities since the risk of failure from 
unfair treatment by the teacher would 
be minimal. Evidence reported in the 
sixth row of Table 1 supports this 


AND ALVIN ZANDER 


prediction. Confidence, then, that one 
will receive the grade he earns if he 
works up to capacity (and not neces- 
sarily a high grade) is associated with 
tendencies to aspire to attain capacity. 
Expert power is based upon the per- 
ceived reliability of the influencer’s 
information. The more an informer is 
perceived as knowing what he is talking 
about, the more the informer is likely 
to influence the recipient of the infor- 
mation. Since we are assuming that 
teachers’ inductions are often placed 
upon the student in the direction of 
working up to capacity, it is evident 
that the teacher’s attempts to influence 
will be more acceptable if the student 
perceives that the teacher knows what 
the students capacity is. Thus, expert- 
ness of the teacher was measured in one 
way with the following question: “Do 
you think your math teacher is a good 
judge of your ability in mathematics?” 
Tt was expected that greater attribu- 
tion of expertness to the teacher would 
be associated with greater congruence 
between aspiration and capacity. In 
the realm of teaching, however expert- 
ness is also conceived as skill in the 
substantive content of the course 
being taught. A measure of this type 
of expertness is the following: “How 
much do you think a “ 
dass a Ink your math teacher 
about the 5 seams he is 
esults reported 
in Table 1 concern only expertness in 
a; of students, since 
the results with the previous measure 
were not different 
espects. The non- 
ons in 
Tow of Table 1 suggest. that rh ~~ 
ness of the teacher does not e Cake 
greater congruence Weiewesy” re 
tions and capacity. The fail 
expertness to be related to con, mre: wat 
might be explained p Sruence 


Raven’s statement (1959) oo sai 


Aspira- 


TEACHERS’ INFLUENCE ON STUDENT ASPIRATIONS ‘d 


expert power results in primary social in- 
fluences on the person’s cognitive structure 
and probably not on other types of systems. 
Of course, changes in the cognitive structure 
can change the direction of forces and hence 
of locomotion, but such a change of behavior 
18 secondary social influence (p. 163). 


Referent power exists in an influencer 
when others desire to be like him. 
Students who are highly attracted to a 
teacher are likely to behave in ways of 
which he would approve, although they 
may not be aware of doing so. It was 
found offensive to students in pretests 
to inquire how much they desired to 
associate with or be like teachers. 
Therefore, measurement of this form 
of Power was by means of the question: 

In general, how much do you like 
your mathematics teacher as a per- 
son?” The prediction that greater 
referent power attributed to a teacher 
would be associated with greater 
congruence between aspiration level 
and perceived capacity was supported, 
as shown in the eighth row of Table 1. 

The effects of the various forms of 
power may be summarized by noting 
the positive effects of rewarding sanc- 
tions, legitimate power, and referent 
Power on the congruence between as- 
pirations and capacity, and the nega- 
tive effect. of indiscriminate coercion. 
nd is preg ag that the first. three 

rms are also significantl: iti 
related to one another, oe Samo | 
they often appear simultaneously in 
the teacher’s behavior and often supple- 
ment one another, as proposed by 
French and Raven. Indiscriminate 
coercive power, on the other hand, is 
negatively related to each of the other 
forms. The first three forms of power 
We may add, are attributed to teachers 
More often by students who attribute 
high capacity to themselves, while 
Coercive power is attributed to teachers 
More often by students who assign low 
pacity to themselves. Nevertheless, 


when perceived capacity is controlled, 
the relationships in Table 1 are not 
substantially lowered and retain their 
statistical significance—students with 
high ability not differing greatly from 
those with low ability. 


Student Performance 


An important consequence of social 
power is the degree that the separate 
forms of power motivate students to 
perform at the level of their aspirations. 
The relationship between the closeness 
of the aspired grades to the actual grade 
the student received at the end of the 
year, and the attribution of reward 
power to the math teacher was r = 
.24**, This relationship is significantly 
greater than its relationship to coercive 
power (r = —.08). Thus, positive 
forms of influence appear to stimulate 
attainment of aspirations more than do 
coercive forms of influence. 


Valence and Difficulty of Attaining 
Capacity 

According to the theory of aspira- 
tion setting proposed by Lewin et al. 
(1944) one specific aspiration level, out 
of a number of possible alternatives, 
is likely to be chosen depending upon 
the degree that it is attractive but not 
too difficult to attain. We thus ex- 
pected to find students placing their 
levels of aspiration closer to their 
perceived capacity the more the capac- 
ity grades were valent for them and 
the less achievement of them was per- 
ceived as difficult. Valence of capacity 
was measured by the question: “How 
good do you think you would feel if 
you did get this grade?” Difficulty was 
measured with the query: “How hard 
would you have to work in order to 
receive this grade?” The expectation 
just stated was supported: valence of 
capacity grade is positively related to 
congruence (7 = .12*), and difficulty 
of attaining capacity grade is nega- 


8 HOWARD ROSENFELD AND ALVIN ZANDER 


tively related to congruence (r 
— .27**). It is noteworthy that valence 
and difficulty are positively related to 
each other (7 = .26**), supporting the 
Lewin et al. (1944) and the Atkinson 
(1957) findings that a difficult goal 
tends to be more attractive than easy 
ones. 

We had expected to find that dif- 
ferent forms of power would have dif- 
ferent degrees of relationship to the 
valence the student attributed to the 
achievement of his capacity grade. A 
teacher who rewards a student for 
working at capacity, for example, 
might generate a greater desire in him 
to achieve capacity than a teacher who 
punishes him for not doing so. The 
statistical relationships between each 
form of influence and valence of reach- 
ing capacity were, however, consist- 
ently too low to be considered reliable. 
But the teachers’ total amount of 
power in attempting to influence stu- 
dents appears to affect students’ per- 
ceptions of the valence of the capacity 
grade. When all forms of power are 
considered together in a multiple cor- 
relation with valence, the multiple 
correlation between power and valence 
of the capacity grades is .27**. The 
nature of the contributions made by 
the separate forms of power, moreover, 
makes it appear likely that a teacher 
who employs several positive bases of 
power simultaneously, to support his 
inductions on a student to work up to 
capacity, will have greater effect upon 
the valence of doing so than a teacher 
who employs only one positive basis of 
power. Why and how social power can 
have effects upon the valence of a goal 
are problems worthy of future atten- 
tion. 

It seems reasonable that the closer a 
student sets his aspired grade to his 
perceived capacity, the more he will be 
satisfied in attaining this established 
aspiration. This contention was sup- 
ported py a correlation of .32** be- 


tween congruence and valence of 
success, the latter measured by the 
query: “How good would you feel if 


you were given the grade you intend to 
get?” 


Desire to Conform 


A direct determination of the readi- 
ness of students to be influenced by 
teachers was sought by the use of two 
related concepts: perceived desire to 
conform and perceived desire nega- 
tively to conform. The former was 
measured with the question: ‘“How of- 
ten do you feel like doing the things 
your math teacher wants you to do?” 
The latter was measured by the query: 
“How often do you feel like doing the 
opposite of what your math teacher 
wants you to do?” All forms of power 
together are strongly related to desire 
to conform (multiple R .56**). 
Desire to conform, in turn, is related to 
congruence (r = .20**), 

The relationships between the sepa- 
rate forms of power and the desire to 
conform, or to do the opposite, are 
shown in Table 3. Consistent ‘with 
Table 1, indiscriminate coercion was 
related to nonconforming desires, in- 
discriminate coercion and reward were 
negatively related to conforming de- 
sires, and all other forms of po a (in- 
cluding expertness) were Dacia? 
associated with conforming desires. 


Altitudes of Stude 
Course 


nts toward Teacher and 


A final interest of this investigation 


was in the relations bi 

etween 

Power and attitudes to ee 
aspects of th 


of your mathemati your opinion 
les tea 
from what it was at eg! changed 


: 99 
inquired about changes ; ). The other 
ward the content of sew attitude to- 


ci 


you feel any different oe (Do 


out mathe- 


TEACHERS’ INFLUENCE ON STUDENT ASPIRATIONS 9 


TABLE 3 
CorRELATIONS BETWEEN ForMs OF 
Powrr anp MotivaTIon 
To CONFORM 


TABLE 4 
CoRRELATIONS BETWEEN Forms oF 
PowER AND CHANGES 
In ATTITUDES 


(N = 415) (N = 415) 
Form of power attribut i Desi 
ed | Desire to ee, 
to t v i : 
0 teachers conform | neestively Form of power attributed | Gitte’ | ‘aratde 
to teachers toward toward 
Comparative degree of aoe | 2 sae = 
rewarding ~ ; 
Discerernt reward, .29** | —.15** oe sesiaiian ake ae 
requency : Raia 
fs 2 ‘ i w ** ** 
neliserlininate reward, | —.11** 09 coe iia os a 
requency . ve arieret 
Discriminate coercion, ie 00 —-¥ iia) iad a 
requency : en car 
eee coer- | —.19** 16** ee aoe si a 
lon, frequency : eats 
Legitimacy oft grading agen | cee oe ee coer- | —.22** | —.21** 
er i i 4 
mpernest in grading | S000 — 0? yegiimaoy of grading | 170° tre 
Weadhes OF 53 — .29* Expertness in grading ian 2g" 
| Referent status of PY fase -40** 
Yiprk iG, teacher 
p< .01. 


oe now than you did before you 
os this math course?”’). Both items 
es ee in terms of direction of 
Ps ude change as well as intensity of 
cf In Table 4 correlations are 
p esented between these two attitudes 
and the different forms of power attrib- 
uted to teachers. The evidence indi- 
oy that each form of power affects 
oth of these attitudes in directions 
Similar to the effects we have seen for 
oe aspirations, and desires to con- 
a and in directions suggested b 
rench and Raven. m 


Results from English Classes 


The results we have th 

served concerning odang ee 
matics were not completely replicated 
when students were queried about 
their English teachers. Similar cor- 
relations were found in mathematics 
and English when relating forms of 
mere to desires for conformity and to 
aa changes toward the teacher 

the course content. Only legiti- 
Mate power, however, was  signifi- 


cantly related to the congruence of 
aspired and capacity grades in the 
English classes. The fact that legiti- 
mate power appears to have positive 
effects in both mathematics and Eng- 
lish is understandable since legiti- 
macy was rated by the students as the 
most effective source of a teacher's 
power in response to an inquiry into 
the relative importance of teachers, 

parents, and peers as power figures. 
The failure of the other bases of 
power among English teachers to be 
related to congruence, however, re- 
quires further explanation. The original 
reason for the inclusion of questions 
about English teachers was the expecta- 
tion that students would be less certain 
of the nature of good performance in 
English courses than in mathematics 
courses. We anticipated that influence 
effective in English 


might be more 
classes because students would be less 


clear about their capacities in that sub- 
ject matter and therefore less confident 
of appropriate aspirations for them- 


10 HOWARD ROSENFELD AND ALVIN ZANDER 


selves, thus being more vulnerable to 
influences from teachers (cf. Festinger, 
1954). Contrary to our expectations, 
more uncertainty about the appro- 
priateness of their aspirations was 
shown in mathematics than in English, 
as indicated by responses to the ques- 
tion: “Do you think you are aiming 
for too high or too low a grade?” Com- 
parisons, furthermore, of responses to 
questions concerning the degree that 
teachers reveal their reactions to good 
or bad performances and the adequacy 
of help rendered by teachers showed no 
differences between mathematics and 
English classes. We were led, there- 
fore, to suspect that the apparent in- 
effectiveness of influence attempts in 
English classes was due not to the 
course material itself or to the methods 
of teaching, but to the motivations of 
students. 

An indicator of student concern over 
performance was available in the pre- 
viously mentioned measure of valence 
of success. In mathematics classes, all 
forms of influence except coercion were 
significantly related to the valence of 
successfully achieving the aspired 
grades. In English classes, however, 
only referent power showed this re- 
lationship to a significant degree. It 
appears, then, that the students were 
more eager to do well in mathematics 
than in English. Further weight is lent 
to this interpretation by responses to 
the question: “Which class do you 
think is more important for your 
future?” In the replies, 34 students 
favored English, 152 preferred mathe- 
matics, and the rest saw them as 
equally important.’ It is interesting to 
note that in the literature on level of 
aspiration, experimental predictions 
are better supported the more that 


* The questionnaire was administered to 
students during the spring of 1959, that is, 
during the post-Sputnik emphasis on mathe- 
matics and physical science. 


subjects have ego involvement in their 
tasks (Lewin et al., 1944; Stotland, 


Thorley, Thomas, Cohen, & Zander, 
1957). 


Summary anp ConcLusIoNs 


A questionnaire was used to explore 
the effects of teachers’ influences upoD 
students’ aspirations for achievement 
in school. Hypotheses drawn from 
earlier work on the differential con- 
sequences of separate types of social 
power were tested in a correlational 
analysis. 

1. Tendencies to accept a teacher’s 
influences are aroused in students who 
are subject to reward, legitimate, 
referent, or expert power; while tenden- 
cies to ignore or oppose what teachers 
desire are aroused in students subject 
to indiscriminate coercive influences. 

2. With the possible exception of 
expert power, these tendencies affect 
the degree to which students set their 


aspired grades congruent with their 
perceived capacities, 


3. Two forms 
tinguished b: 
inadequate 


seems to 
aspiration setti 

in 
performance. § as well as future 


4. Two forms o 
criminated 


influences are 
criminate reward 
¥ reward for adequate 


F er indi; 
but Increased i 


Performances 


5. The posit; : 
up by th Positive or negative forces set 


© separate ba: 


effective in determinj 


— of Power are 
gs Aspirations +0 


TEACHERS’ INFLUENCE ON STUDENT ASPIRATIONS il 


the degree that the students are ego 
involved in the performances on which 
they are setting aspirations. 


REF ERENCES 


ATKINSON, J. Motivational determinants of 
tisk-taking behavior. Psychol. Rev., 
1957, 64, 359-372. 

ARTWRIGHT, D. Power: A neglected varia- 
ble in social psychology. In D. Cart- 
wright (Ed.), Studies in social power. 

n Ann Arbor, Mich. : Inst. Soc. Res., 1959. 

ESTINGER, L. Wish, expectation, and group 
Standards as factors influencing level of 
aspiration. J. abnorm. soc. Psychol., 

5; 1942, 37, 184-200. 

ESTINGER, L. A theory of social comparison 

Pr Processes. Hum. Relat., 1954, 7, 117-140. 

RENcH, J. R. P., Jr., & Raven, B. The 
bases of social power. In D. Cartwright 
(Ed.), Studies in social power. Ann Ar- 
bor, Mich.: Inst. Soc. Res., 1959. 


Lewin, K., DemBo, Tamara, FESTINGER, 
L., & Sears, Pav.ine. Level of aspira- 
tion. In J. McV. Hunt (Ed.), Personal- 
ity and the behavior disorders. New 
York: Ronald, 1944. 

Manopter, G., & Sarason, S. B. A study of 
anxiety and learning. J. abnorm. soc. 
Psychol., 1952, 47, 166-173. 

Sampson, E. An experiment on active and 
passive resistance to social power. Un- 
published doctoral dissertation, Uni- 
versity of Michigan, 1960. 

Sror.anp, E., Tuorusy, S., THoaas, E., 
Couen, A., & ZANDER, A. The effects of 
group expectation and self-esteem upon 
self-evaluation. J. abnorm. soc. Psychol., 
1957, 54, 55-63. 

Zier, Sueria. An experimental study of re- 
sistance to influence. Unpublished doc- 
toral dissertation, University of Michi- 
gan, 1958. 


(Received August 16, 1960) 


Journal of Educational Psychology 
1961, Vol. 52, No. 1, 12-18 


ASSESSING EMERGING LEADERSHIP BEHAVIOR 
IN SMALL DISCUSSION GROUPS 


MARVIN TAYLOR, ROBERT CROOK, ann STANLEY DROPKIN 
Queens College, New York 


The definition of leadership in in- 
teractional terms as typified by Gibb’s 
(1947) discussion has created a new 
frame of reference for the description 
of leadership in behavioral terms. In- 
stead of seeking static ‘“‘traits of leader- 
ship” in an individual, investigators 
have been seeking criteria of “leader- 
ship behavior.” For the purposes of 
the present study leadership may be 
defined as the influence of an individual 
in interaction with other individuals 
within a group setting. In recent years 
many promising technics have been 
developed for studying leadership 
behavior. Findings reported by the 
Ohio State Leadership Study Group 
suggest an important concept, namely, 
that leadership acts may be thought of 
as “initiating structure in interaction” 
and “showing consideration” (Halpin, 
1956a, 1956b). What appears to be of 
particular significance is the finding 
that these two dimensions may be use- 
ful in distinguishing between effective 
and ineffective leaders. 

The Leadership Behavior Descrip- 
tion Questionnaire (Halpin, 1956b), a 
paper-pencil instrument, developed by 
the Ohio State group is useful in ob- 
taining descriptions of individuals 
already in leadership positions. Such a 
technic is limited in its usefulness for 
assessing emerging leadership. An 
instrument constructed for observation 
of emerging leadership patterns would 
be particularly useful in exploring the 
relationship between various group- 
situational factors and _ successful 
leadership behavior. Such assessments 
would call for “‘on the spot?’ observa- 
tions of behavior and their categoriza- 


12 


tion according to a set of predeter- 
mined criteria. 

This paper describes an attempt to 
observe individuals in small leaderless 
groups and to categorize the emerging 
behavior employing the two dimen- 
Sions of leadership behavior mentioned 
above. Before it is possible to estimate 
the predictive validity of such an ap- 
proach, it appears necessary to estab- 
lish that: (a) a group of raters can be 
trained to agree consistently about the 
classification of observed behaviors, 
(b) the behaviors observed are stable 
and consistent from one situation t0 
another, and (c) the categories em- 
ployed are sufficiently independent of 
each other to yield pertinent informa- 
tion about each individual in accord- 


ance with the underlyin concept of 
effective leadership, . . 


Meruop 
Sample. The subjects , 
the graduate divisi were all students in 
Each subject was 
ing in two, 30-min 


r x ute, si 
situations. The dj ra BU 


a _X-man discussion 
Ascussion 


was composed of 9) 


there were f, 37 su 


d 


ndently 


ASSESSING LEADERSHIP BEHAVIOR 13 


wee Rating Instrument. The theoretical 
inge ept of leadership underlying the rating 
cee is indicated in the introductory 
bavi ents. Its major emphasis is on those 
bs ee of members of discussion groups 
Bae may be classified according to the 
categories, described by Halpin (1956a) as: 
eee structure in interaction” and 
he a consideration.’”” In some of the 
at experimental data obtained by the 
hill = and ina later formulation by Hemp- 
Fae ng reer sad it was indicated that a 
— er distinction should be made between 
gue nee to initiate structure and success in 
wa In the present technic, the 
ane ehaviors are recorded as influence 
ae Tecording an individual’s behavior, if 
bell mies identifies a problem, suggests 
ie for finding a solution to a 
tion ous or opens a channel of communica- 
tally rg gasevo person, he is accorded a 
stricta 1 category, ‘attempts to initiate 
ee If there was some overt ac- 
the viedgment by another group member of 
inane to initiate structure,” the 
. rch ual is accorded an influence tally. If 
the a identifies himself as a member of 
oe . ohp. eupports another’s point of view, 
ddan of another’s feelings or 
catego: es, he receives a plus tally in the 
thee ys shows consideration.” If, on the 
helen oor he is sarcastic, caustic, or at- 
fy Rstecdcd er group member, 2 minus tally 
- ee : Thus, in this area, a group mem- 
tallies. eceive both positive and negative 
stat go la Each of the subjects was ini- 
po nnnee at random to a discussion 
Raciiees n 1958, assignments to the second 
ciple then group were based on the prin- 
te og the group members should be new 
ship w other. In 1959, the group member- 
“a as held constant in both situations. 
atdiss Oa ec all conditions were stand- 
Rebhi ased on previous research (Wilson 
grou ins, 1955), six-person discussion 
Hack acne maintained wherever possible.* 
iscussion group was assigned the 


1 
pone to scheduling difficulties it was not 
he. le to have all groups with six persons. 
Rumbo ety of the groups did have this 
es er, however, one group had four mem- 
that and another seven. Research suggests 
partguthough six is an optimal number of 
Mite ee there are not statistically sig- 
Gees differences between groups when 
abbr. Seven people are observed (Wilson & 
ins, 1955). 


same discussion problem when the members 
were seated in the one-way vision room. 
Each session lasted 30 minutes. 

Scores. Although an individual had to 
attempt to initiate structure (Category 1) 
before he could influence the structure 
(Category 2) these acts theoretically repre- 
sent different types of behavior. The scores 
in Categories 1 (attempts to initiate struc- 
ture) and 2 (influence in initiating structure) 
were obtained by summing the number of 
tallies recorded for each subject by each 
rater. The ‘score’? for each subject in 
Category 3 (shows consideration) was the 
difference between positive and negative 
tallies. Rater reliability, subject consist- 
ency and stability have been estimated for 
each category and are reported below. 

Analysis of Data. The analysis of variance 
technique was utilized to estimate the sub- 
ject consistency, subject stability, and rater 
agreement coefficients for each leadership 
behavior category.? Initially coefficients of 
rater agreement were obtained for each 
problem situation as well as for both situa- 
tions combined (cf. Table 1). On the basis 
of these coefficients of rater agreement it 
was decided to combine the subject’s ratings 
within each leadership behavior category 
over both problem situations. These com- 
bined ratings were treated to estimate the 
subject consistency (reliability) and sta- 
bility coefficients for each leadership be- 
havior category. Since the problem situa- 
tion conditions (e.g., changing membership) 
were different for the 1958 and 1959 samples 
the data are analyzed independently. _ 

The coefficients of rater agreement in 
each situation for each leadership behavior 
category were estimated by the formula: 


Resrooment'= (S37 — S2)/S# 
where S.? equals the between subjects mean 
and S.2, the residual error mean 


The coefficients of rater agreement 
bined situations were derived by the 


square 
square. 
for com) 
formula: 

Ragreement 1.2 = (S32 — Sir)/S3 


In this formula the S?, stands for the vari- 
ance error due to subjects by raters inter- 
action. This coefficient may be interpreted 
as the correlation between ratings of NV 


h to express thanks to 
Donald Medley, Division of Teacher Educa- 
tion, Bureau of Research, Board of Higher 
Education of New York City, for his help in 
the statistical analysis of the data. 


2 The writers wis 


14 MARVIN TAYLOR, ROBERT CROOK, AND STANLEY DROPKIN 


judges and ratings of N other judges on 
these same performances. 

The coefficients of subject consistency 
(reliability) for each Leadership Behavior 
Category was estimated by using: 


Reons. = (S22 — Sip — Sir + S.2)/S2 
The only new term added here, Sip, equals 
the variance from errors due to the subjects 
by problems interaction. These coefficients 
may be interpreted as the correlations be- 
tween ratings by N judges on two problems 
and ratings by N judges on two other prob- 


lems. In obtaining the coefficient of sta- 
bility, the formula was : 


Ratav. = (Si? — Sip)/S2 


This may be interpreted as the correlation 
between ratings on two problems and rat- 
ings on two other problems by the same 
judges. The assumption underlying the 
derivations of the various formulae were 
based on Model II considerations as dis- 
cussed by Medley, Mitzel, and Doi (1956). 


ReEsvuuts 


Coefficients of rater agreement rele- 
vant to each Leadership Behavior 
Category for each problem situation 
and for situations combined are pre- 
sented in Table 1. The coefficients for 
“Attempts to initiate structure” 
ranged from .52 to .74 in 1958 for a 
sample of 32 subjects and from .73 to 
.89 in 1959 for a sample of 37. The 
relationship between observer ratings 
for the category ‘‘Success in initiating 
structure” ranged from .81 to .82 in 
1958 and from .83 to .88 in 1959. On the 
“Shows consideration” category the 


coefficients ranged from .73 to .92 and 
from .70 to .92 in 1958 and 1959, 
respectively. All the coefficients pre- 
sented are statistically significant well 
beyond the .01 level of confidence. 
Comparing the 1958 estimates of rater 
agreement with those of 1959, small 
increments in the latter coefficients 
may be noted. This may be due to in- 
creased sampling, or to the fact that at 
least two of the judges had had an 
additional year of training and experi- 
ence. 

Subject consistency and_ stability 
within each Leadership Behavior Cat- 
egory over two problem situations are 
presented in Table 2. The consistency 
coefficients range from .07 to .51 in 
1958 and from .58 to .92 in 1959. The 


coefficients pertinent to stability of 


performance ranged from .07 to .51 
for 1958 and 


from .59 to .85 in 1959. 
n 1959 all the coefficients were statis- 
pe Significant beyond the .01 level 
Perce es It may be noted that in 
ie categories the estimates of consis- 
wney and stability are considerably 


higher for th 
1958 data. M 1959 data than for the 


this below, Tore will be said about 


i 3 presents correlation coef- 
independenna to the degree of 
exch of an or overlap, between 
Categories, Leadership Behavior 
‘dente These coefficients were 

Puted, by the Pearson product- 


TABLE 1 


CoEFFICIENTS OF RATER AGREEMENT For E. 


ACH 
OVER CoMBINED Discusston Sr cUssION Srrvation anp 
ONS 
Situation 1 
Leadership Behavior Categories Situation 2 Conta 
1958 1959 
(NW = 32) | (W = 37) 1958 1950 ee 
1959 
initiate struc ef 
Attempts to initia ture -89 52 7 ee 
Success in initiating structure -82 85 8 +73 74 
Shows consideration -73 .70 ces - -82 = 
E Fl “37 . 
Note.—All coefficients are significant at the .01 level of confidence, 02 


ASSESSING LEADERSHIP BEHAVIOR 15 


é TABLE 2 
OEFFICIENTS OF STABILITY AND CONSIST- 
ENCY FOR THREE LEADERSHIP 
BEHAVIOR CATEGORIES OVER 
ComBinep Discussion 


SITUATIONS 
Consist- aT 
ney Stability 
Leadership Behavior Categories 8] 3 
ij 
las 
22) a-| 2 oS 
B5/ 2° | 8128 
Attempts to initiate |.43 |.58*|.37*|.59* 
5 Structure 
uccess in initiating |.07 |.74*|.07*|.77* 
Structure 
hows consideration .51*|.92*| .51*|.85* 


a 
Group ipi 
membershij a i ion si ion wi 
changed, p in each discussion situation was 


b 
th Group membership in each discussion situation was 
© same, 


Rinse 
Significant at the .01 level of confidence. 


™ TABLE 3 
ELATIONSHIP BETWEEN LEADERSHIP BE- 
pase Carecorizs (COMBINED 
ITUATIONS) AND PERCENTAGE 
or ComMon VARIANCE 


Percentage 


Leadership B r Categorie: r | lof Over! 
p Beh: 
avior Categories N lof Overlap 


Attempts/Suecess .03| 37) 00 
7 empts/Consideration | .00} 37| 00 
cess/Consideration 65] 37} 41 


ay method, for the 1959 data only 
for are based on the combined ratings 
eee subject. Only the correlation 
Hen een, the Success and Considera- 
Shes categories was found to be signifi- 
= at beyond the .01 level of confi- 
lee ce (r = .65). The coefficients 
= ween Attempts and Success (r = .03) 
nd Attempts and Consideration (r = 
) suggest little or no relationship. 


Discussion 


- The findings of the present study 
lee be examined in terms of the three 

Sues raised in the introduction to this 
Paper, 


Rater Agreement 


The data presented in Table 1 seem 
to support the notion that raters 
(obervers) can be trained to observe 
an individual’s behavior in a small 
group setting and agree consistently 
with other observers about which acts 
demonstrate “attempts to initiate 
structure,” “success (or influence) in 
initiating structure,” or “shows con- 
sideration.” This finding in itself is not 
particularly significant, having been 
demonstrated any number of times in 
the literature (Bales, 1950; Bass, 1954). 
What may prove of some importance, 
however, is that the raters agreed 
about acts which are theoretically 
related to effective leadership as dem- 
onstrated by the Ohio State studies of 


leadership. 


Independence of Leadership Categories 


Although the raters agree with each 
other with a high degree of consistency 
it is important to determine the magni- 
tude of the relationship among the 
three Leadership Behavior Categories. 
One may ask: ‘“‘Are the raters actually 
discriminating between the various 
types of behavior observed?” Table 3 
presents data which suggest that there 
is probably no overlapping between 
“attempts” and “gonsideration” and 
between “attempts” and “success or 
influence.” The relationship between 
“syecess” and “consideration” indi- 
cates that there is approximately 41 % 
overlap in variances. Thus one may 
conclude tentatively that each cate- 
gory is somewhat independent and may 
be measuring important attributes of 
effective leadership. On the basis of the 
Halpin (19562) work on effective 
leadership it may be postulated that a 
portion of the relationship between 
“syecess” and “consideration” may be 
due to a third variable, i.e., effective 
leadership. It seems reasonable to 


6% 


contend that the three Leadership 
Behavior Categories are probably in- 
dependent enough of each other to be 
useful measures. 

From another point of view, it may 
be argued that an individual’s “at- 
tempts to initiate structure” have little 
to do with whether or not he shows 
consideration toward the oth 


er group 
members. On the other hand, one 
must not conclude that “attempts” 
and “ 


success” are not related. The 
present scoring system minimizes the 
possibility of correlation between “at- 
tempts” and “influences.” At present 
@ new scoring system is being sought, 
but results are not yet conclusive, 
Comparison of the estimates of rela- 
tionship between the three categories, 
however, seems to Support the argu- 
ment that “successful initiation of 
structure (i.e., influence on the group 
problem solving procedures)” is related 
to consideration shown by the influen- 
tial individual to other group members. 
Although one may try to influence his 
peers, apparently one must also show 
consideration for them if he is to suc- 
ceed in influencing the group problem 
solving procedures. 


Stability and Consistency 


If any instruments to Proven useful 
for predictive purposes or for making 
generalizations, it is necessary to assess 
behaviors which are stable and con- 
sistent. Analysis of the 1959 data sug- 
gests that subject Performance jg 
recorded as reasonably stable and con- 
sistent in all three Leade 


c tship Behavior 
Categories by the four raters. It js 


interesting to note that the AIS acts 
are observed as less stable and con- 
sistent than acts in the other two cat- 
egories. The major source of error 
variance influencing all of the coeffi- 
cients of stability and consistency 
(this is true for 1958 and 1959 data) is 


MARVIN TAYLOR, ROBERT CROOK, AND STANLEY DROPRIN 


the subject X problems interaction. It 
may be hypothesized that the major 
source of variation influencing the S 
P interaction term involves the con- 
stancy of the group membership cn 
the nature of the problem presente i 
each situation. In the case of the en 
cess and consideration categories cy 
coefficients of stability and te 
are increased considerably in on . 
holding group membership const@ . 
and presenting two similar camera 
Examination of the coefficients ween 
vant to the AIS category indicalt® 
that the magnitude of the ageeg: 0 
is considerably less than for the The 
cess or Consideration categories: py 
Present design does not permit ® 
clear explanation of these findings: at 
Seems logical to contend, howev' oe 
that a person has more cognitive CO ie 
trol over his attempts to initiate pet 
ture than his behavior in the other tw° 
categories. The “success or influence 
ehavior is, in reality, a measure a 
the other group members’ reactionS te 
Individual A’s attempts to initiate 
Structure. Hence, Person A has litt A 
“control” over the stability of er 
“acts” except as he chooses not pole ie 
tempt to initiate structure. The “at 
tempts to initiate structure” acts ap 
Pear to be primarily cognitively 
oriented, whereas, the consideration 
acts appear to be basically noncogni- 
tive. If this hypothesis is tenable then 
erson A could not control the non- 
Cognitive acts as directly as he could 
control the “attempts” he makes t0 
initiate structure. On the basis of the 
data presented in Table 2 it seems 
Probable that the individual who is 
Perceived as considerate and who at- 
tempts to initiate structure during the 
St group session has a higher proba- 
bility of success than the individual 
who is not seen as considerate. It seems 
likely that the individual who was suc- 


ASSESSING LEADERSHIP BEB AVIOR 


cessiul during the first group session 
will continue to be successful during a 
second situation if the group member- 
ship is unchanged and the nature of the 
problems is similar. On the other hand, 
the individual who attempts to initiate 
structure but is not influential (for 
whatever reason) must change his 
behavior in the next group discussion. 
pats he may either make more or 
aoe attempts during the next session. 

hus, his “attempts” behavior would 
ies less stable and consistent. Such 
a ine is, of course, extremely 
: ative but may indicate some of the 
nteresting problems about which the 
present technic may help to provide 
More information. 

Pas instrument may have consider- 
able use for both research purposes 
= more pragmatic ends. For ex- 
Taek, the classification of leadership 

ehavior into three categories permits 
: ing aes clarification of the relation- 
ots between various leadership acts 
os a measures. A second project 
a be to find the relationship be- 

a en various group dimension vari- 
ables and leadership behavior cate- 
This technic might be useful in 

€signing selection procedures where 
vocational success is highly related to 
effective leadership behavior. 

The present instrument seems to 
satisfy the criteria for content validity 
rater reliability, and overall reliability. 

e next research steps are to find out 
whether this technic also satisfies th 
criteria for construct, concurrent i 
predictive validity. Pente 


Summary 


Thirty-two subj i 5 

l 3 jects in 1958 a: 7 
subjects in 1959 were observed rl 
ee leaderless group discussions by 
our trained observers. Each subject’s 


Wt 


leadership behavior was categorized as 

“attempts to initiate structure,” “‘suc- 
cess in initiating structure,” or “shows 
consideration.” 

The findings suggest: (a) Raters can 
be trained to agree consistently about 
the classification of leadership behav- 
iors observed duringa group discussion. 
(b) The behaviors in the “success” and 
“consideration” categories are highly 
stable and consistent from one discus- 
sion situation to another when the 
group membership is held constant 
and when problems of a similar nature 
are being discussed. When these con- 
ditions do not prevail the leadership 
behaviors are not particularly stable or 
consistent. When the group member- 
ship is held constant the attempts to 
initiate structure acts are not as 
stable or consistent as the other two 
types of acts. (c) The three categories 
seem sufficiently independent to be 
considered as three distinct types of 
observed behaviors. The correlation 
between success and consideration 
seems to suggest & mediating variable, 
i.e., effective leadership. 

The technic is seen to have potential 
value for leadership—group situa- 
tional—organismic variable research. 
Considerably more work is necessary 
to determine whether the instrument 
satisfies the criteria for predictive, 
concurrent, and construct validities. 


REFERENCES 


F. Interaction process analysis: A 
of small groups. 
Addison-Wesley, 


Bags, R. 
method for the study 
Cambridge, Mass.: 


1950. 

Bass, B. M. The leaderless group discussion 
as a leadership evaluation instrument. 
Personnel Psychol., 1954, 7, 470-477. 

Gipzs, C. A. The principles and traits of 
leadership. J. abnorm. soc. Psychol., 
1947, 42, 268. 


18 MARVIN TAYLOR, ROBERT CROOK, AND STANLEY DROPKIN 


Hapin, A. W. The behavior of leaders. replication. J. exp. Educ., 1956, 24, 221- 
Educ. Leadership, 1956, 14, 172-176. (a) 229. 

Harin, A. W. The leadership behavior of Witson, Puyuus C., & Rossins, I. An 
school superintendents. Columbus: Ohio evolving technique for appraising inter- 
State Univer., 1956. (b) personal effectiveness. J. teach. Educ., 

Mepuiey, D. M., Mirzen, H. E., & Dor, 1955, 6, 233-238. 

A. N, Analysis of variance models and 
their use in a three-way design without (Received March 31, 1960) 


Journal of Educational Psychology 
1961, Vol. 52, No. 1, 19-24 


THE EFFECT OF ATTITUDES ON PERFORMANCE 
IN MATHEMATICS 


LEWIS R. AIKEN, JR. 


Woman's College, University of North Carolina 


anp RALPH MASON DREGER! 


Jacksonville University 


Recent work (Dreger & Aiken, 1957; 
Dutton, 1954; Gough, 1954; Poffen- 
burger & Norton, 1956; Tulock, 1957) 
Suggests that performance in mathe- 
matics is influenced by nonintellective 
as well as intellective variables. Mathe- 
maphobia (Gough, 1954), i.e., pro- 
nounced fears in the presence of arith- 
metic and mathematics, and other 
negative attitudes toward mathe- 
matics demand explanation. The 
simplest: explanation is that such re- 
actions result from experiences specific 
to the learning of mathematics, in 
particular that the manner in which 
Significant others, viz., teachers and 
parents, instruct children in mathe- 
matics is the primary determinant of 
their attitudes toward this subject, 
referred to here as ‘‘math attitudes.” 
The present investigation provides a 
limited test of the direct experience 
hypothesis of the etiology of math 
attitudes by studying, first, relations 
between selected intellective and non- 
intellective variables and math atti- 
tudes and, second, the contributions of 
these attitudes to the prediction of 
achievement in mathematics. 


HyPorTHEsres 


Relation of Math Aititudes to Achieve- 
ment Measures 


Final mathematics course grades. 
Math attitude scores make a significant 
1The authors wish to thank Lyle V. 
Jones of the University of North Carolina 


Psychometric Laboratory for his help in 
planning and implementing this investiga- 


tion. 


19 


contribution to the prediction of final 
grades in a mathematics course. 

Mathematics achievement — test 
changes. Math attitude scores predict 
gains in scores from initial to final 
administration of a mathematics 
achievement test when training has 
intervened. 


Relation of Math Attitudes to Person- 
ality Measures 


Temperament. Math attitudes are 
unrelated to specified “general person- 
ality” variables. 

Ability. Math attitudes are posi- 
tively correlated with numerical abil- 


ity. 


Relations of Math Attitudes to Experi- 
ences with Mathematics 


Ratings of mathematics teachers. 
Math attitudes are positively cor- 
related with subjects’ ratings of former 
mathematics teachers. 

Reported parental encouragement, 
(a) Math attitudes are positively cor- 
related with subjects’ reports of early 
parental encouragement of mathe- 
matical endeavors. (b) Math attitudes 
are unrelated to subjects’ reports of 
encouragement of studying academic 
subjects in general. 

Reported parental attitudes toward 
mathematics. Math attitudes are posi- 
tively correlated with subjects’ reports 
of parents’ own math attitudes. 

Reported traumatic experiences with 
mathematics. Math attitudes from 
favorable to unfavorable are correlated 


20 


negatively with the number of frustrat- 
ing or embarrassing situations associ- 
ated with mathematics. 


MetHop 


Measures. Paragraphs describing atti- 
tudes toward mathematics written by 310 
college students were reduced to scaled 
items according to Likert’s procedure (Ed- 
wards, 1957) to constitute the basis for the 
Math Attitude Scale. The final scale con- 
sisted of 10 items connoting negative atti- 
tudes and 10 connoting positive. Sample 
items are:? 

8. Mathematics makes me feel uncom- 
fortable, restless, irritable, and impatient. 

13. I approach math with a feeling of 
hesitation—hesitation resulting from a 
fear of not being able to do math. 

18. I love mathematics, and I am hap- 
pier in a math class than in any other 
class. 

5. Mathematics makes me feel secure, 
and at the same time it is stimulating. 
Preliminary investigation using this 

scale attested to its reliability (r = .94 for 
test-retest). In addition, a test of independ- 
ence between the scores on the attitude 
seale and scores on four items designed to 
measure attitudes toward academic sub- 
jects in general suggested that attitudes 
specific to mathematics were being meas- 
ured (x? = .80, df = 1). 

Besides the main nonintellective meas- 
ure, the Math Attitude Scale, the Min- 
nesota Counseling Inventory (Berdie & 
Layton, 1957) and the Intensive Personal 
Data Sheet were selected to assess nonintel- 
lective variables. The MCI was chosen not 
only because as a group inventory it met 
time requirements but also because it ap- 
pears to incorporate some of the better 
features of its well-known cousins, the 

I and the Minnesota Personality 
Scale, and to assess variables expected to 
relate to academic performance. The IPDS, 
developed at the University of Southern 
California, was adapted for use in this 


a a 

* The Math Attitude Scale has been de- 
posited with the American Documentation 
Institute. Order Document No. 6545 from 
ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress; 
Washington 25, D.C., remitting in advance 
$1.25 for microfilm or $1.5 for photo- 
copies. Make checks payable to Chief, 
Photoduplication Service, Library of Con- 
gress. 


LEWIS R. AIKEN, JR. AND RALPH MASON DREGER 


study. Intellective measures employed were 
three of the Differential Aptitude Tests— 
Verbal Reasoning, Numerical Ability, and 
Abstract Reasoning (Bennett, Seashore, & 
Wesman, 1952)—and the Cooperative Math- 
ematics Pretest for College Students (Math- 
ematical Association of America, 1947), 
high school mathematics averages, and 
final grades in college freshman mathe- 
matics. 

Subjects and Procedures. On the basis of 
their scores on the mathematics pretest, 
administered during orientation week, en- 
tering freshmen at a southeastern college 
who elected their mathematics for the fall 
Semester were assigned to general mathe- 
matics, intermediate algebra, or college 
algebra classes. Most of the analyses were 
carried out on the five sections of general 
mathematics. All the data beyond the pre- 
test were collected during the first few 
meetings of the classes, except in the case 
of the second administration of the mathe- 
matics pretest which took place one week 
before the final examinations. Data were 
analyzed primarily by means of multiple 
and partial correlation and regression meth- 
ods. Hypotheses were tested for males and 
females separately inasmuch as a pilot study 
had indicated the possibility of sex differ- 
ences in math attitudes. 


Resutrs 


Math Attitudes and Achievement M eas- 
ures 


Final Course Grades. Multiple re- 
gression analyses of the predictive 
value of the Math Attitude Scale were 
made with the 60 males and 67 females 
taking general mathematics. On the 
basis of the intercorrelations among 
the five intellective predictor variables 


and the criterion variable (final grades), 
regression analyses were restricted to 
the independen 


{ variates of high school 
mathematics average, DAT Verbal 
Reasoning, DAT Numerical Ability 
and the Math Attitude Scale. The 
maultiple correlation coefficients we 
‘67 and .63 for males and femal, 4 
respectively (p < 01). Foy ¢ i, 
validation, the predictor Equatio ae 
students in general mathematio. » {or 

: cs Were 


ATTITUDES AND PERFORMANCE IN MATHEMATICS 21 


TABLE 1 
Tests of SIGNIFICANCE oF PARTIAL RE- 
GREssIoN COEFFICIENTS IN MULTIPLE 
ReGression or DIFFERENTIAL 
AptitupE Test VARIABLES, 
Maru ArtirupE, anp Hieu 
Scuoot Maru AvERAGE oN 
Fina Grapes IN 
GENERAL Mati 


| Values of ¢ 
Variable | 
Males Females 
| (N = 60) | (N = 67) 
Hi 
igh School Math Av- | 4.12** | 1.23 
Tage 
aoe Attitude Scale 1.18 | 3.16** 
Die Verbal Reasoning | 4.07** | .91 
Numerical Ability | 5.09** | 2.01* 


ee 
es fusulficant beyond the .05 level. 
ignificant beyond the .01 level. 


er plind fs the scores on the independ- 
i. meres students taking alge- 
ne he obtained and predicted grades 
= correlated .69 for the 42 
r; es and .65 for the 20 females. Tests 
it of the partial regression 
Une clents in the original equations 
oo wed that, for the males, all variables 
7. eps Math Attitude made significant 
a ar cogen Only Math Attitude 
: _DAT Numerical Ability played 
Fine roles in the predictor equa- 
The for the females (see Table 1). 
us, the hypothesis of significant 
Ontribution of math attitudes to 
Prediction of achievement is borne out 
or females, but not for males. 
Achievement Test Changes. In the 
Part of the study relating to gains in 
mathematics achievement test scores, 
test. and retest data on the mathe- 
Matics pretest were obtained on 52 
males and 63 females in the three 
mathematics courses. The partial cor- 
relation coefficients between Math 
Attitude Scale scores and retest scores 
On the mathematics pretest, partialing 
Out the effects of initial scores on the 
latter, were .33 for males and .34 for 


females. Both of these coefficients are 
significant beyond the .02 level. As 
hypothesized, Math Attitude Scale 
scores predicted gains in scores on the 
mathematics pretest. 


Math Attitudes and Personality Meas- 
ures 

Temperament. As a test of the hy- 
pothesis of the unrelatedness of “gen- 
eral personality” (or temperament) 
variables to math attitudes, multiple 
regression analyses were carried out, 
including the seven part-scores of the 
MCI with the DAT Verbal Reasoning, 
Numerical Ability, and Abstract Rea- 
soning tests, and with a criterion of 
Math Attitude Scale scores. The larg- 
est portion of the variance, for both 
sexes, was accounted for by the regres- 
sion of DAT Numerical Ability on 
Math Attitude Scale scores (see Table 
2). However, MCI Leadership, for 
males, was significantly correlated with 
math attitudes (r = —.21, p < .05). 
As Table 2 indicates, there is a slight 
suggestion that females with good 
“adjustment to reality’ have more 
positive feelings toward mathematics 
than those with poorer adjustment. 
For males, leadership qualities and 
positive math attitudes may be re- 
lated. In either case, the significant 
relation could be a chance result. 
Considering the general lack of cor- 
relation, it may be concluded that the 
hypothesis of unrelatedness of tem- 
perament measures and math attitudes 
is confirmed, though obviously not 
proved. 

Ability. Confirmation of the hy- 
pothesis that math attitudes are 
positively related to numerical ability 
is found in the significant partial cor- 
relations between DAT Numerical 
Ability and Math Attitude Scale scores 
when the effects of the other two DAT 
tests are partialed out. These coeffi- 
cients are .23 for 87 females and .51 


ViSLENE 


22 LEWIS R. AIKEN, JR. AND RALPH MASON DREGER 


TABLE 2 
ANALYsIs OF REGRESSION OF MINNESOTA 
CounseLING INVENTORY AND DIFFEREN- 
TIAL Aptitups Tests ON Mati 
ArritupE ScaLeE Scores 


| 


Source of Variation laf} ss F 
Males 
(NV = 96) 
Regression due to 10 |10, 907.629) 4.17** 


Regression due to DAT 1/5482 .898/25.19** 
Numerical Ability 

Regression due to9 var- | 9, 399.265] 1.83 
iables (omitting Nu- | | 
merical Ability) | 

Regression due to 8 var-| 8) 339.930) 1.56 
iables (omitting MCI | 
Leadership and DAT | 
Numerical Ability) 


| 
variables | 
| 

| 


Error [85 217.675, 
Females 
(N = 87) 


Regression due to 10 10) 855.098) 3.70** 
variables 

Regression due to DAT 
Numerical Ability 

Regression due to 9 var- 
jables (omitting Nu- 
merical Ability) 

Regression due to8var- | 8| 253.612, 1.10 
jiables (omitting MCI 
Adjustment to Real- 
ity and DAT Numer- 
ical Ability) 

Error 76 


4847 .482)20.98** 


C=) 


411.499) 1.78 


231.025) 


op < 01. 


for 96 males in the three mathematics 
courses. A significant difference (p < 
01) between the two coefficients 
suggests that for females individual 
differences in Verbal Reasoning and 
Abstract Reasoning make important 
contributions to the determination of 
attitudes toward mathematics. 


Math Attitudes and Experience with 
Mathematics 


Ratings of Mathematics Teachers. The 
first portion of Table 3 presents evi- 


dence concerning the predicted positive 
relation between students’ ratings of 
their former mathematics teachers and 
their own math attitudes. Correlations 
are between scores on the rating scales 
of the IPDS, referring to remembered 
characteristics of teachers, and the 
Math Attitude Scale scores. The plus 
and minus signs in parentheses after 
the names of the variables in Table 3 
indicate high and low ends of the 
scales, respectively. Thus, the value 
of .34 for males, opposite the Patient 
vs. Impatient scale, evidences 2 
significant positive relation between 
positive attitudes toward mathematics 
and reported patience in previous 
mathematics teachers. It is noteworthy 
that more of the items are significant 
for females than for males, though 
none of the differences between the 
coefficients for males and females is 
statistically significant. The number of 
significant coefficients probably does 
not arise by chance. Math attitudes 
= thus apparently related to remem- 
ered impressions of teachers, the 


female more clearly so than the 


attitudes. male 


ing scale 
the hy- 
between 
agement 
a study in general 
Table 3. Since. © second section of 


no evidence that m 

a 
related to memory a 
ouragement of studyin 
or academic subjects in 


parental eng 
mathematics 
general, 
Parental Aititudes towa 
matics. To obtain data “ guiathe- 
pothesized relation between © hy. 
math attitudes and theiy Students? 
their parents’ attitudes towang ores of 
~~ Mathe- 


ATTITUDES AND PERFORMANCE IN MATHEMATICS 23 


TABLE 3 
Correzations BETWEEN MATH ATTITUDE 
ScaLteE Scores anp Scores on Items 
ReLatep to StupENT ImpRESSIONS 
or MATHEMATICS TEACHERS AND 
I-XPERIENCES WITH PARENTS 


| Correlation with 
\Math Attitude Scale 


Scale 
Males | Females 
(NV = 96) (N = 87) 
Your impressions of your math 
teachers: 
Patient (+) vs. Innpatient (—) w34ee 26% 
Strict (—) vs. Lenient (+) —.06 —10 
Hostile (—) vs. Friendly (+) 22° As 
Pair (+) vs. Unfair (—) -l4 
Demanded high standards (+) 15 | sit? 
vs. Did not care (—) 
Domineering (+) vs. Submis- 05 —.05 
Sive (—) 
ie of fun (+) vs. Grim (—) -08 are 
sata (—) vs. Kind (+) 07 
Clever (+) vs. Dull (=) 10 : 
pean (—) vs. Controlled (+-) | —.04 +10 
“new their subject well (+-) vs. 13 ae 
Were severely lacking in 
knowledge of subject (—) 
Really knew how to teach math 1S ree 
(+) vs. Did not know any- 
thing about how to teach 
math (—) 
As you experienced your father 
: when you were a child: 
Stressed my school work greatly | —.04 2 
(+) vs. Paid no attention to 
my school work (—) 
Encouraged me to study math 0S ll 
(+) vs. Discouraged me from 
‘i studying math (—) 
8 you experienced your mother 
. when you were a child: 
Stressed my school work greatly | —.07 -00 
(+) vs. Paid no attention to 
_™my school work (—) 
Encouraged me to study math —.10 l4 
(+) vs. Discouraged me from 
studying math (—) 


te Significant beyond the .05 level. 
* Significant beyond the .01 level. 


matics, Math Attitude Scale scores 
were correlated with the scores on the 
following scales of the IPDS: 

As you experienced your father when you 
were a child: Liked math vs. Disliked math 


As you experienced your mother when 
you were a child: Liked math vs. Disliked 


math 


The correlations between the Math 
Attitude Scale and the two variables 
above were .08 and .10, respectively, 
for the 96 males and .13 and .16 for 
the 87 females. Since these coefficients 
are not significantly different from 
zero, the hypothesized relation between 
students’ reported perceptions of the 
parents’ feelings toward mathematics 
and the students’ own attitudes toward 
the subject is not confirmed. 

Traumatic Experiences with Mathe- 
matics. The answers to the following 
questions on the IPDS were used to 
test the hypothesized relation between 
attitude and traumatic experiences 
with mathematics: 


Can you remember any specific embar- 
rassment or insecurity pertaining to your 
performance in arithmetic or math when 
you were a child? If so, describe. How old 
and in what grade were you when this took 


place? 


A chi square test of independence 
between Math Attitude Scale scores, 
dichotomized at the median, and a 
Yes-No answer to the first question 
gave values of .62 and 1.11 for the 
96 males and 87 females, respectively. 
Since neither value is significant, the 
hypothesis of independence between 
the two variables cannot be rejected. 
There were not enough responses to 
the remaining items to make a statisti- 


cal analysis. 


Discussion 


Assuming that the various measures 
employed for this study accurately 
assess their respective variables, we 
may conclude that the direct experience 
hypothesis which guided the investiga- 
tion overall is partly supported, but 
only partly. 

Math attitudes are apparently re- 
lated to intellective factors and achieve- 
ment, but not to temperament vari- 
ables, at least within the limitations of 


24 LEWIS R. AIKEN, JR. AND RALPH MASON DREGER 


this study. Experiences with former 
mathematics teachers are somewhat 
related to present math attitudes, 
although remembered parental en- 
couragement and math attitudes and 
own remembered traumatic experi- 
ences are presumably unrelated. 

The partly unrealistic nature of the 
assumption of validity of measures 
employed in this study is easily seen. 
Final grades, for example, are not 
entirely adequate to assess achieve- 
ment in mathematics. The group tem- 
perament inventory suffers from the 
well-known failings of such measures. 
Within the limitations of the measures, 
however, the conclusion of the influ- 
ence of direct experience upon math 
attitudes may be upheld. 

Other research will have to be under- 
taken to determine if factors not 
investigated here are operative in 
determining math attitudes. At best 
the proportion of variance of the 
variables here associated with math 
attitude variance is small. Other 
variables must be uncovered whose 
variance further accounts for that in 
the Math Attitude Scale. Perhaps a 
less direct method of ferreting out 
traumatic experiences with mathe- 
matics, for one instance, would reach 
some of the bases of mathemaphobia 
and other negative attitudes. Projec- 
tive measures or objective inventories 
other than those we have employed 
might reveal relations not found in 
this study. 


SuMMARY 


The Math Attitude Scale, DAT 
Numerical Ability, Verbal Reasoning, 
and Abstract Reasoning tests, the 
Cooperative Mathematics Pretest for 
College Students, the Minnesota Coun- 


seling Inventory, and an adaptation of 
the Intensive Personal Data Sheet were 
administered to college freshmen 10 
mathematics courses to determine 1n @ 
limited way the etiology of math 
attitudes. Regression and correlation 
analyses of the intercorrelations of 
these measures and their relations with 
grades in high school and_ college 
mathematics courses supported to @ 
modest extent the supposition that 
direct experiences in relation to mathe- 
matics contribute to math attitudes. 
Other influences in the determination 
of negative math attitudes are not 
excluded by the findings in this study- 


REFERENCES 


Bennett, G. K., Srasnorn, H. G., & WES- 
MAN, A. G. Differential Aptitude Tests: 
Manual. New York: Psychological 
Corporation, 1952. 

Berpis, R. F., & Layron, W. L. Minnesota 
Counseling Inventory: Manual. New 
York: Psychological Corporation, 1957. 

Drecer, R. M.,& Arxen, L. R. The identi- 
oe easter anxiety in a college 
population. J. educ. P. 48, 
eae sychol., 1957, 

Dourron, W. H. Measurin, i yard 

oN, a. £ attitudes toward 

- arithmetic. Elem. sch, J., 1954, 55, 24-31. 

—- ds I Techniques of attitude scale 
uction. New Y . - 
Century-Crofts, 1957. a eee 


Gouau, M. F, Math ia: 
wee Clee ehabia: Causes and 


. 290-294. aring House, 1954, 28, 
MATHEMATICAL Ag 
Committee SOCIATION Or AMERICA, 


on Tests. Cooperative Math- 
Scat Pretest for College Students: 
Ser ual. Princeton: Cooperative Test 
vice, 1947, 
OFFENBURGER, T, 
Factors determ 
arithmetic an 


N., & Norron, D. A. 
ae attitudes toward 
mathematies. Arj 
roe teh» 1956, 3, 113-116. iow 
vLock, M. K. Emotional blocks in mathe- 
matics. Math. Teach., 1957, 50, 572-576 


(Received April 27, 1960) 


Journal of Educational Psychology 
1961, Vol. 52, No. 1, 25-29 


THE RELATION OF STUDENTS’ NEEDS TO THEIR 
PERCEPTIONS OF A COLLEGE ENVIRONMENT 
ANNE McFEE* 


Syracuse University 


: Over the past 3 years the College 
Characteristics Index (CCI) (Stern 
& Pace, 1958) has been filled out by 
Several thousand students in more 
han a hundred colleges (Pace, 1960; 
& Stern, 1958; Thistlethwaite, 

959). The instrument is intended to 
give an estimate of the press of the 
college environment. The 30 press 
Scales in the CCI parallel the 30 needs 
an in the Stern Activities Index 
: I) (Stern, 1958). In using and 
ri eae the instrument, it is 
bap important to know whether 

© personality of the students who 
answer its items has any appreciable 
relationship to the way they answer 
them. 
geen and Heist (1959), noting 

' the personality characteristics of 
Student bodies vary widely from one 
college to another, and even between 
colleges which are highly selective in 
Scholastic aptitude, have raised the 
question: “Do students make the 
college?” If one is to study the inter- 
action between students and environ- 
ments one must have independent 
estimates of each. The CCI should 
give an estimate of the environmental 
press independent of the personality 
needs of the students responding to it. 
Does it in fact do so? This study at- 
tempted to answer this question on 
two levels; the general relation between 
corresponding need and press measures, 
and the specific relation of each CCI 
1 is based on the writer’s MA 
ology at Syracuse University. 
hes to thank C. R. Pace for 
his extensive suggestions and guidance 


throughout the course of the study. 
2 Now at Stanford University. 


1 This pape 
thesis in Psych 
The author wis! 


item to a relevant personality need 
scale. 

Two other factors were studied. The 
first is the objectivity of the CCI 
item. The hypothesis here is that the 
more easily verifiable the behavior or 
knowledge the item describes, the less 
likely it is that people will see it in an 
individual way. The second factor 
considered is the likelihood that the 
student has a basis of personal ex- 
perience for saying that the item is 
“true” or “false” of his environment. 
The hypothesis is similar to the one 
relating to the objectivity of the item, 
ie., the more familiar the students are 
with the behavior described in the 
item, the more they will agree on its 
truth or falsity. When few students 
have experienced the behavior in 
question, there will be more disagree- 
ment. 

The respondent to the CCT is asked 
to report whether certain specified 
behaviors or conditions are true of 
his college environment. It seems likely 
that this situation would tend to 
minimize unique, individual sets, and 
facilitate the expression of opinions 
that have been acquired by members 
of the college group through contact 
with the immediate environment. The 
perceptions of a particular group 
should show 2 high degree of agree- 
ment, but there should be considerable 
differences among groups. Preliminary 
study of the variance of scale scores 
within and between colleges on an 
earlier version of the CCI has shown 
this to be the case (Pace & Stern, 


1958). 


26 ANNE McFEE 


METHOD 


“Perception of the environment” is de- 
fined by the answers students give to the 
300 items of the CCI. Briefly, the index con- 
sists of 30 scales, with 10 true-false items 
in each scale. The items describe common- 
place activities or conditions which occur 
or might occur at college. The scales de- 
seribe 30 types of environmental press. A 
press is an aspect of the environment which 
tends to encourage or reward a particular 
type of behavior. For example, if the item 
“Many upperclassmen play an active role 
in helping new students adjust to campus 
life’ is answered “‘true”’ it is presumed to 
reflect a press toward nurturant behavior. 

The personal characteristics of the stu- 
dent examined in this investigation are his 
scores in the AI. The AI also consists of 30 
scales of 10 true-false items each, but it is 
designed to measure level of need in the 
individual, rather than intensity of press 
in the environment. 

The items of the CCI were classified for 
objectivity and exposure value by three 
judges working together. Approximately 2 
weeks later, two of the judges went over 
the classifications. When there was doubt 
about an extreme item, it was shifted toward 
a middle category, in an effort to make 
extreme categories as free as possible from 
ambiguities. In this second session, only 
about 10% of the items were shifted, 

For each item of the CCI, response fre- 
quencies were tabulated for people answer- 
ing in the direction of the key, and in the 
opposite direction. In other words, for each 
item, there was a group of respondents who 
“passed” it, and another group who 
‘failed’? it. The mean AI score, on the 
scale corresponding to the CCI scale in 
which the item appeared, was calculated 
separately for the ‘‘pass”’ and “‘fail”’ groups. 
For each CCI item, ¢ tests were computed 
between AI scale means for the “pass” and 
‘fail’? groups. For a sample of this size, the 
t’s would not be significant if the differences 
between the means were less than .50, so 
tratios were calculated only for items show- 
ing differences of .50 or above. This mini- 
mum difference was necessary for signifi- 
cance no matter how the ‘‘pass”’ and ‘fail’? 
groups were proportioned (97-3, or 50-50). 

Subjects. Responses to both the CCI and 
the AI were obtained from 100 students in 
introductory psychology classes at Syracuse 
University. They are not necessarily a 
representative sample of Syracuse students; 


the study is not intended to provide a 
generalization about a particular univer- 


sity, but only about groups of college stu- 
dents. 


ReEsutts 


Perception Related to Personality Needs 


_ Pearson product-moment correla- 
tions were calculated between each 
pair of scales carrying the same label 
on the AI and CCI, e.g., between 
Need Achievement-Press Achicvement. 
These correlations ranged from —.007 
to .057. The median correlation was 
.006. Of the 30 correlations, 24 were 
m a positive direction, and 6 were 
negative. 

For a sample of 100 cases, a Pearson 
r of .197 will be significant at the 5% 
level of confidence. None of these 
correlations is Significant. Since on 
chance basis alone occasional r’s of 
much larger magnitude than those 
obtained would be expected (one or 
two above .197) all computations were 
double checked. It is not likely that 
the lack of demonstrated relationship 
is due merely to low scale reliabilities. 
Reliabilities for the CCI and the AI 


Tange from 
with a mean of -65. 


Response [ Jniformi 
ee mit: i 
Seeiity ty Related to Ob 


So posure Value of 
To test ¢ i : 
eden he hypothesis that easily 


lable items will be answered m 
consistently by the students, the 300 
“I items were classified as (a) high} 
objective, easily verifiable from obvi Be 
criteria; (b) somewhat objective, Vv i 
fiable from criteria requirin ells 
observer or otherwise less obvious. Phe 

; 


PERCEPTIONS OF A COLLEGE ENVIRONMENT 


i) 
~I 


TABLE 1 


VARIABILITY OF STUDENT RESPONSES AND RELATION oF STUDENT NEEDS 
to CCI Irems Varytnc IN PosstBiLity oF OBJECTIVE VERIFICATION 
AND IN ExposuRE VALUE 


Characteristic 


No. of Items 


% producing % producing % significantly 


Objectivity of Items: 


Highly Objective Items 62 
Somewhat Objective 107 
Subjective 131 

Exposure Value of Items: 
High Exposure 51 
Medium Exposure 119 
130 


Low Exposure 
2 


uniform divided related to 
responses responses needs* 
13 42 8 
5 65 8 
2.0* 2.3* 0 
24 41 6 
3 62 19 
4.2** 2.6* 2.2* 


—__ 


AON, 
- palate t's (.05 level) between AI scale means for 
aa ignificant at the .05 level. 

Significant at the .01 level. 


(c) Subjective, confirmable only by 
asking more people the same question. 
f €sponse frequencies were compared 
or the three groups. The results are 
shown in Table 1. 
i There is a high degree of uniformity 
a response to an item if nearly every- 
eh Answers it the same way, whether 
ihe is in the direction of the key or in 
i. opposite direction. Of the 62 items 
Classified as highly objective, 13 % had 
Tesponse percentages of 90 and above 
or 9 and below, compared to 5% of 
the 131 items classified as highly 
Subjective. Thus the more objective 
items tend to evoke a somewhat larger 
Percentage of highly uniform responses 
than do the subjective items. This dif- 
ference is significant at the 5% level 
(2 = 2.0). . 

Of the highly objective items, 42 % 
fall in the middle response range of 
response percentages (30 to 70 in 
response percentages), and 65 % of the 
highly subjective items fall into this 
response range. The middle range 
represents the responses that are the 


,-6hCS~« SN 


“pass” and “fail’’ groups on each CCI item. 


least uniform, since they center on 
the 50%, or completely divided, level. 
A smaller proportion of highly objec- 
tive than highly subjective items falls 
in this range, ie., produce disagree- 
ments among respondents. This dif- 
ference is significant at the 5% level 
(2 = 2.8). 

To examine the hypothesis that 
students will agree more in reporting 
behavior with which they are all 
likely to be familiar, the items were 
classified under three levels of probable 
familiarity or “exposure value”: high, 
medium, and low. 

Of the 51 items classified as high in 
exposure value, 24% had response 
percentages of 90 or above or 9 and 
below, compared to only 3% of the 
130 items classified as low in exposure 
value. The percentage of uniform 
responses to items of high exposure 
value was eight times the percentage of 
uniform responses to items of low 
exposure value. This difference is 
significant beyond the 1% level (z = 


4.2). = 


28 


Of the high exposure items, 41% fell 
in the middle response range, and 62% 
of the low exposure items fell in this 
range. A smaller proportion of high 
exposure than low exposure items 
produced disagreement am 


ong re- 
spondents. This difference is significant 
at the 5% level (z = 2.6). In this 


respect, there is no difference between 
the objective - subjective and high 
exposure-low exposure types of classi- 
fication. Both produce the same 
Proportion of items in the middle 
response range. 


Relation of Responses 
Need, Objectivity, 
Value of Item 


Of the 300 CCI 
showed ¢ ratios, Significant at the .0; 
level or above, between “Dass” 
“fail” group means on 
ing AI scale, 

Of the items classed a 
jective, 8 % showed signifi 
Of the highly subjective 
8% showed significant t 
objectivity of an item do 
to be related to the degre’ 
of personality needs on 
sponses. 

Six percent of the hi 
items showed significant 
tween “‘pass” and “fajl”’ 
on the corresponding AT scale. Of the 
low exposure items, 19% showed 
significant ¢ ratios. When a student, 
lacking direct experience, Must guess 
at the answer to a question, whether 
objective or subjective in content, his 
own needs are more likely to influence 
his judgment to a significant degree, as 
reflected by these ¢ ratios, The dif- 
ference in proportions of significant 
vs between the high and low exposure 
items is significant at the 5% 
(2 = 2.2). 


lo Personality 
and Exposure 


items, 36 (12%) 
5 
and 
the correspond- 


S highly ob- 
cant t ratios. 
items, again 
ratios. The 
es not seem 
€ of influence 
student re. 


gh exposure 


t ratios be- 
group means 


level 


ANNE McFEE 


SumMMARY 


This study attempted to baer 
some of the relations between gine 
perception of the college ie ag ® 
and various other factors. lf be 
College Characteristics Index is wr ae 
useful to investigators as an objecti ‘i 
indicator of differences between ait 
leges, it should be indepen! ha 
of the personality needs of This 
informants filling it out. ‘on 
study failed to find any correlation” 
between scale scores of individuals ne 
the CCI and their parallel scores Ae 
the AI, a personality test using para’ od 
scale classifications; nor was a strong 
relation found between personality 
need and the students’ perception fr 
environmental press, as reflected és 
individual items. The responses to 88 : 
of the 300 CCT items were independen 
of the parallel personality need of th 
respondent. _ 

Differences in objectivity of indi- 
vidual items produced a moderate 
difference in uniformity of response t0 
the items, but produced no discernible 
differences in the influence of need 0D 
the item Tesponses. Items about be- 
havior or Conditions which the student 
1S unlikely to have encountered (i.€+ 
those low in “exposure value”) pro- 
duced much less agreement, and were 
much more influenced by need, than 


were items about more widely shared 
experience, 


REFERENCES 


McConnext, as Whe ies Heist, P. Do stu- 
ents make the college? Coll. Univer.; 
ag 34, 442-459, 


> ©. R. Five college environments. 
Coll. Bd. Rev., 1960, 41, 24-98 
Ace, C. R., & Sern, G. G. An approach 
to the measurement of psychologica) 
characteristics of college environments, 
J. educ. Psychol., 1958, 49, 269-277. 
STERN, G. G. Activities Index, Form |, ' 
Syracuse, N. Y.: Psychologica) Re. 
Search Center, Syracuse Univer, | 1958 


PERCEPTIONS OF A COLLEGE ENVIRONMENT 29 


Stern, G. G. The ecology of the American N. Y.: Psychological Research Center, 
college student: Varieties of constraint Syracuse Univer., 1958. 
in American education. Syracuse, N. Y.: | THISTLETHWAITE, D. L. College press and 
Psychological Research Center, Syra- student achievement. J. educ. Psychol., 


Se cuse Univer., 1959. (Mimeo) 1959, 50, 183-191. 
ERN, G.G., & Pace, C. R. College Charac- 
teristics Index, Form 1158. Syracuse, (Received May 2, 1960) 


Journal of Educational Psychology 
1961, Vol. 52, No. 1, 30-34 


INHIBITION PHENOMENA IN FAST AND 
SLOW LEARNERS 


JAMES B. STROUD 


State University of Iowa 


The purpose of this study was to 
investigate intralist and interlist in- 
hibition as a function of length of list 
and level of ability (rate of acquisition), 
Two groups of subjects, widely sep- 
arated in rate of learning, were com- 
pared both on learning and recall of 
lists of two different lengths, in which 
all items by all subjects were learned 
to a common criterion of two a: 
two reinforcements. By definition, 
slow learners require more trials, more 
item presentations, than fast learners, 
They require more presentations to 
achieve the initial correct responses in 
the course of learning a list, It is also 
possible that the stability of correct, 
responses, as determined by the prob- 
ability of their Occurring twice in 
succession, once they have o. 
may vary with ability level. On long 
lists, all subjects require more item 
presentations to achieve the initial 
correct responses than they do on 
short lists (Robinson & Darrow, 1924. 
Robinson & Heron, 1922), ' 

Also it is possible that the increasin 
of length would affect the stability of 
responses of all subjects, but that of 
slow learners more than of fast learners. 
Possibly fast learners behave on long 
lists much as do slow learners on short 
ones. One of the more important dif- 
ferences between the two ability groups 
may lie in a difference in their ability to 
withstand the effects of intralist in- 
hibition. 

As stated thus far, the experiment 
was so planned as to: (a) investigate 
the relative effects of length upon 
trials required to learn by two widely 
separated ability groups, and (b) 
compare recall scores of the two groups 
of subjects on two different lengths of 


nd only 


ccurred, 


30 


AND 


LAMORE J. CARTER 
Grambling College 


lists: under the condition in which a 
individual items on both lists by ® 
subjects were learned to a common? 
operational criterion. 

As an attempt to further the pur ee 
of this investigation, certain items jus 
learned as a warm-up exercise were 
interspersed both in the short and the 
long list. It seemed worthwhile to 
know whether or not the retention 9 
the warm-up items, when presented i2 
a larger context, would vary witl 
ability level and length of list.! This 
amounts to what appears to be ® 
procedure for investigating retroactive 
inhibition as a function of length of 
list and of ability level. _ 

The question of differences by ability 
level in Susceptibility to retroactive 
inhibition seems rather important. 19 
two recent, investigations, no relation- 
ship was found between rate of ac- 
quisition and recall (Stroud & Schoer, 
1959; Underwood, 1954), If ability 
differences are associated with resist- 
ance to inhibition as just suggested and 
if this should turn out to be a genera 
Phenomenon, then slow learners should 
: ei more susceptible to retroactive 
inhibition effects than fast learners, 
pier in terms of widely accepted 

y of forgetting, retain less well 


What they lear ae 
bit te tn. Perhaps all this is & 


; © experimental 
d in the two investigations 
med. 


* This procedure 
method of investig: 
retroactive 
latter since i 
troublesome 


appears to be a: 
ating both pro, 
inhibition—particy 


2 efficient 
Te and 
arly 

t very largely gets rid of the 
problem of rehearsal, the 


INHIBITION IN FAST AND SLOW LEARNERS 


PROCEDURE 


i cat learning tasks were em- 
of jeecees purposes of selecting two groups 
of 12 pai ts, one fast and one slow. A list 
Ae aed adjectives and one of 10 paired 
Sisted of ames were used. The latter con- 
male fot socngei (faces and shoulders) of 
and last lege students and fictitious first 
Strips orneec These were placed on film 
classes of prpeives upon a screen before 
e first trial ege sophomores. Throughout 
stimulus a , 2 picture or an adjective, as a 
en both ember, was exposed for 2 seconds; 
taneous] members of a pair were simul- 
item: § exposed for 2 seconds. When all 
followed a the list were thus exposed, there 
members x Hace trial in which the stimulus 
e adiectincss were exposed, 4 seconds for 
uring thi es and 6 seconds for the pictures. 
Shecified bl 8 ae the subjects wrote in 
88 they ¢ e k spaces such of the responses 
recall trials d recall. Two exposure and 
Adjective in were allowed on the paired 
Ure-name 5 and three on the paired pic- 
responses ist. Total number of correct 
Stituted on the two lists combined con- 
ie scores. 
ach werk Sane two groups of 32 subjects 
and Bataan Gi ected, representing the top 
Oi the. ox 5% of the group sampled. 
Paired aA ene proper, two lists of 
and fsa tes of average associative 
(1943) let eae values from the Haagen 
2 iter were used. Three separate lists 
en aah each were prepared. First one, 
“item i mg of these lists was used as & 
er of tim » each being used an equal num- 
Maining ee for this purpose. The two re- 
item list ists were combined to form a 24- 
ree bloc Asser was learned as such. The 
Totated es s of 12 items were systematically 
item lists roughout among the 12- and 24- 
ferences 4 an attempt to control possible 
ong a in item difficulty between the 
nd short lists. 
6p ae ere pairs were inseribed on 4 X 
or use Toa st a pair per card, appropriate 
Ment, the fi e Card Master. In the experi- 
exposed fi rst member of a given pair was 
member or 2 seconds, following which both 
stir A, were simultaneously exposed for 
Wéte ra is. As learning proceeded, subjects 
respon quired to anticipate verbally the 
val in se member within the 2-second inter- 
es which the stimulus member alone was 
eee A trial consisted of a single expo- 
Sure, in this manner, of all the items in a 
ist. The serial order of the cards was varied 
systematically throughout the learning 
rials, Each subject learned both the lon, 
and the short list, the order altetadting 


31 


from subject to subject. An interval of + 
seconds was interspersed between trials. 

Each subject, just prior to undertaking 
each of his learning tasks, learned a warm- 
up list consisting of three pairs, by the 
method just described. Each such list was 
learned to a criterion of five errorless trials. 
For purposes stated presently, each of the 
three warm-up pairs, when learned, was 
placed in the appropriate list to be learned. 
Operationally this increased the length of 
each list by three items. On the short list 
the three warm-up items were placed in the 
fifth, tenth, and fifteenth positions; 02 
the long list, at the ninth, eighteenth, and 
twenty-seventh positions. 

‘As a further condition of the experiment 
each item, except the warm-up items,? was 
withdrawn, from the list when and as it was 
correctly responded to twice, consecutively 
or not. This procedure was adopted for two 
reasons: it had the effect of shortening 
learning time; it insured the same number 
of operationally defined reinforcements on 
all items of both lists by all subjects. The 
latter seemed especially important for 
investigation of the effect of length and 
ability differences upon retention. 

Subjects appeared, individually, at ap- 
pointed times, learned the appropriate 
warm-up task and the appropriate main 
experimental task. After an interval of 48 
hours, they reappeared, engaged in a reca 
performance on the original experimenta’ 
task, under the same conditions as those 
under which it was learned, and proceeded 
to learn the second warm-up task and the 
second main experimental task. For this 
task, a recall performance was exacted 48 


hours later. 


RESULTS 


The means of the number of trials 
required to learn the short and the 
long lists by the two ability groups are 
presented in A of Table 1. There is 
little room for doubt that the selection 
procedure used produced two groups of 
subjects far apart in learning ability. 
By the procedure of deleting items 
from the list as soon as they had been 
twice responded to successfully, trials 
became shorter as learning progressed. 
By number of trials is meant the 


aster does not work well 


2 The Card M 
four cards are in the ma- 


when fewer than 
chine. 


TABLE 1 
LEARNING AND RECALL Data 
| Subjects 
Task Score ee 
| Fest | Stow 
A. Means of Trials to j 
Learn 
Short List | 9.16) 36.00 
Long List 36.38 | 114.88 
B. Mean Number of Con- 
secutive Successful 
Anticipations 
Short List 11.34 9.78 
Long List 21.90 | 18.40 
C. Mean Number of Un- 
successful Anticipa- 
tions of Warm-up 
Items 
Short List 22 2.13 
Long List 44 2.66 
D. Means of Recall Scores 
Short List 5.38 5.13 
Long List 13.25 | 10.94 


number of times a list or 
it was presented. 

Under this revised-list method, the 
increasing of the length of list resulted 
in the increasing of the number of 
trials required to learn and did so at 
an accelerating rate. This is in keeping 
with results on length obtained by the 
conventional method of learning the 
list as a whole to a criterion (Robinson 
& Darrow, 1924; Robinson & Heron, 
1922). 

The increasing of length resulted in 
a greater increase in number of trials 
required to learn by the slow learners 
than by the fast learners—78.88 to 
27.22. An analysis of variance pro- 
cedure yielded a significant Length x 
Ability interaction (p < 005). Inci- 
dentally, the Length x Order inter- 
action was also significant at a like 
level, suggesting that the practice 
effects from the long list upon the 
short list were greater than those of 
the short list upon the long one. 

The data were next analyzed with 
respect to the number of times correct 
responses once made occurred twice in 


some part of 


JAMES B. STROUD AND LAMORE J. CARTER 


succession, for the two ability anes 
and the two lengths. In this ogre 
the three interspersed warm-up yi 
were counted, with the result that ee 
maximum number of times two a 
secutive correct responses could ha 10 
been made was 15 and 27, for the 3 
lists. The results are presented in B; 
Table 1. o 
On both lists, the number of ho 
secutive successful responses made ter 
the fast group was somewhat grea : 
than that made by the slow ae 
This is consistent with Underwoo g 
(1954) observation that the sores aa 
of a slow learner’s response eS, at 
less to habit strength than that o e 
fast learner. However, the seni 
effect of list length upon the two grouP 
Was about the same. ful 
The mean number of unsuccess : 
anticipations of the three previous! 
learned items (warm-up) interspes 
in the lists is presented in C, Table ts 
by ability level and by length of list. t 
Ability differences were significaD 
(> < .005). Ability x Length inter- 
action was not significant. The dif- 
ferences associated with length were 
Rot significant. It seems clear that the 
fast learners could better withstand 
the Interfering effects resulting from 
the Presentation of the warm-up items 
within a context of Similar items thaD 
could the slow learners. It may be that 
with longer warm-up lists, in the sense 
ate here, Significant, length effects and 
ength x Ability interaction would 
have been obtained, 
hee Scores of the two groups of 
On the two lengths were com- 
wads : able 1, shows the mean 
alled—within & 2-second 
3 Interval, 48 hours after 
learning— for these comparisons, 
Investigations of the relation pe- 
tween list length and recall have ob- 
tained higher scores in 
called for the longer lists 
lists were learned to a co 


INHIBITION IN FAST AND SLOW LEARNERS 33 


(Robinson & Darrow, 1924; Robinson 
& Heron, 1922). This has generally 
been explained in terms of a relatively 
greater overlearning of some of the 
Individual items in the longer lists than 
in the shorter ones. In this experiment, 
all items were learned to the same 
objective criterion. 
' Long lists require more trials to 
earn, but when learned are retained 
as well as short lists learned in fewer 
trials. _Consistent with this is the 
Proposition that slow learners require 
tee trials to learn lists of all lengths 
2 an do fast learners, but when learned 
Y ain them as well as do fast learners. 
‘he over-all ability effect was not 
Significant. However, separate analysis 
ie a significant difference between 
mt two ability groups on the long list. 
The Length X Ability interaction was 
Significant (p = .05). 
ee the order effects on recall 
i mentioning in this con- 
ee In all of the foregoing analyses, 
ir eat? (order in which the lists 
i earned) were determined. Gen- 
‘Al y, these were large and significant. 
rs ce order interactions as Order X 
ility, Order X Length were signifi- 
nt. In recall, there were no order 
effects, 


Discussion 


a Probably no one doubts that there 
are differences among people in recall or 
In other measures of memory. In most 
practical life situations, differences in 
degree of learning, in familiarity, in 
Subsequent utilization of learning 
rehearsal, and others operate. These 
have an effect upon recall. There 
appears to be no compelling reason to 
post some kind of differences inherent 
oo basic psychological process in order 
he account for the observed differences. 
apa one possible basic difference 
we process does suggest. 


hy i i i 
here is some evidence in the results 


of the present investigation that slow 
learners are more susceptible to inter- 
list interference than fast learners. 
This suggests that they may be more 
susceptible to the effects of retroactive 
inhibition, in the traditional sense. Our 
findings relative to ability differences 
in the recall of the warm-up items 
(C, Table 1), we think, support this. 
If various kinds of interfering effects do 
affect slow learners more adversely 
than fast ones, and do so generally, 
we should certainly expect recall ability 
to be related to learning ability. This 
assumption follows from the fact that 
retroactive inhibition or interference 
effects is our principal explanation of 
loss in ability to recall learned material. 

As already noted, some recent work 
(Stroud & Schoer, 1959; Underwood, 
1954) suggests that differences in recall 
may be unrelated to differences in 
learning ability. At least this was 
found to be the case in the experiments 
in question. At this stage it would 
seem unwise to generalize very far 
about this. In the experiments just 
referred to, subjects learned lists of 
nonsense syllables, paired adjectives, 
and _picture-names, and presumably 
went about theirnormal business during 
the time between learning and recall. 
In these respects the materials and 
general procedure were not different 


from those employed by earlier workers 
who (by reason of the erroneous use of 
ure of retention— 


relearning as 2 meas 
Stroud & Schoer, 1959) concluded 
that retentive ability is positively 
related to learning ability. 

In the course of normal events, 
college students would encounter little 
material between learning and recall 
calculated to interfere with the recall 
of the learned material. At least inter- 
ference effects should be at a minimum. 
In any event, it would be interesting 
to ascertain whether or not differences 
in learning ability would: be found to 
be associated with differences in recall, 


Bes JAMES B. STROUD AND LAMORE J. CARTER 


re appropriate interpolated learning 
epitomaneally introduced. Conceivably 
retention and learning ability may be 
related after all. a 

There is, however, one disquieting 
thought: a list of nonsense syllables, 
learned in a psychological laboratory 
ought to be one of the best remembered 
things in the world. Hardly anywhere 
else would one encounter so few things 
within a 24- or 48-hour retention period 
to interfere with the recall of the 
learned material. Obviously, such 
material is forgotten at a rapid rate. 
Incidentally, the fact that conventional 
laboratory material is forgotten so 
readily under conditions which from 
the standpoint of interference should 
be highly favorable to its retention, 
suggests that there may be factors 
other than retroactive inhibition op- 
erating in forgetting. 

Loss in availability of response may 
in such cases contribute to the rela- 
tively rapid rate of forgetting, es- 
pecially when the subject has only a 
short interval in which to make a 
response. Perhaps loss of warm-up 
effects, or loss of appropriate mental 
set, which warm-up exercises may 
help to restore, are involved. Again, 
this phenomenon may be especially 
important in the recall of nonsense 
syllables or pairs of unrelated words, in 
comparison with the recall of meaning- 
ful material, where the subject: is 
frequently able to make use of a rich 
complement of associative cues. 


SuMMARY 


ist inhibiti function of 
tralist inhibition as a | 
ne level and length of list has been 


investigated. Two groups of subjects 
widely separated in learning ability 
learned and recalled (after 48 hours) 
two lists of paired adjectives of 12 
and 24 pairs. All items on both lists 
by all subjects were learned to @ 
criterion of two and only two correct 
anticipations. Increasing “list. length 
resulted in a disproportional increase 
in number of trials required to learn, 
but in a proportional increase in the 
number of words recalled. Over-all, 
ability differences in recall were not 
significant. However, they were sig- 
nificant on the long list. The interspers- 
ing of previously learned items, re 
ferred to as warm-up items, within 
the two lists resulted in ability dif- 
ferences in loss of response. The 
results are discussed in relation to 
ability differences in recall. 


REFERENCES 


Haacen, C. H. Learning and retention 28 
a function of the synonymity of original 

and interpolated tasks. Unpublished 
doctoral dissertation, State University 
of Iowa, 1943. 

Rozinson, BE. S., & Darrow, ©. W. Effect 
of length of list upon memory of num- 
jo* Amer. J. Psychol., 1924, 25, 235- 

Rozinson, B. 8. & Heron, W. T. Results 
of variation in length of memorized 
a J. exp. Psychol., 1922, 5, 428- 

ee J. B., & Scnorr, L. Individual 

rences in memory, J. 
1959, 50, 285-908 ty. J. edue, Psychol., 


- Speed of learnin d 
amount, i 105: 
rece one tine. Psychol. Bull., 1954, 


(Received June 20, 1960) 


Journal of Educational Psychology 
1961, Vol. 52, No. 1, 35-43 


CONSISTENCY AND WISDOM OF VOCATIONAL 
PREFERENCE AS INDICES OF VOCATIONAL 
MATURITY IN THE NINTH GRADE 


DONALD E. SUPER 


Teachers College, Columbia University 


ee practice of inquiring concern- 
g the vocational preferences and 
ee of junior and senior high 
. ool students in helping them to 
sn educational or prevocational 
ony and plans is widespread and 
mand = compelling arguments: it 
aia :) unthinkable, in a democratic 
of tt y, to require the determination 
a 1e directional choices which stu- 
nts make in selecting college pre- 
— commercial, trade, home 
praia agriculture, and general 
gs without taking into account 
i hee to which the youth aspires. 
= . n school guidance questionnaires 
i filled out by entering pupils ask 
ot occupation they hope to enter, 
numerous studies have been made 

of ways in which to elicit information 
a vocational preferences 
A cilin, 1952; Gilger, 1942; Hamburger, 

958; Trow, 1941). 

The consistency, and particularly 
the wisdom or realism, of vocational 
preferences have often been used as 
measures of the effectiveness of vo- 
cational guidance programs by practic- 
ing counselors and by educational 
and psychological research workers 
(Froelich, 1949; Williamson & Bordin 

i 1941). Having 2 vocational objective is 
important in a society in which earning 
a living is important, in which occupa- 
tional roles are of major significance, 
and in which education is, in effect if 
not avowedly, occupationally oriented: 
having a vocational preference, in 
this context, gives purpose to behavior 
and makes possible educational and 
vocational decisions. Tt has been 
argued that consistency of vocational 


35 


————_— 


preferences shows intensity and validity 
of interest, and that it is better to work 
consistently toward one clear-cut goal 
than wastefully to keep shifting 
objectives. In the case of wisdom of 
vocational preferences, the reasoning is 
that realistic goals are by definition 
attainable, whereas unrealistic oF 
unwise goals are by definition those 
which one is not likely to attain or 
with which one is not likely to be 
satisfied if one does attain them. 

Studies using consistency of voca- 
tional preferences are notvery common, 
this being typically 2 counselor’s 
method of judging how seriously to 
take a student’s expressed preference, 
but tue method is illustrated by 
Rothney’s recent Wisconsin Guidance 
Study (1958). 

Wisdom or realism of vocational 
preferences has been used as & criterion 
of the effectiveness of vocational guid- 
ance programs in many studies, ¢.g- 
Sparling (1933), Kefauver and Hand 
(1941), Stone (1949), Rothney and 
Roens (1950), Hoyt (1955), Rothney 
(1958), and Hewer (1959). Indices of 
realism are often used by counselors in 
judging students’ and clients’ need 
for guidance. 

Despite the widespread acceptance of 
the importance of consistency and 
wisdom of vocational preferences, 
many writers on vocational guidance 
and on vocational development have 
questioned the significance of expressed 
vocational preferences in early adoles- 
cence. Fryer’s (1931) review pointed 
clearly to the conclusion, confirmed by 
Carter’s later review (1944), that the 


expressed preferences of boys and 


36 


girls in their early and middle teens 
are unstable. More recently still, 
Schmidt and Rothney (1955) reported 
convincing evidence on the instability 
of expressed vocational preferences 
from one year of high school to the 
next and into the first year out of 
school: only 49% of the “choices” of 
the tenth grade remained the same in 
eleventh grade, and this figure was 
reduced to 35% in the twelfth grade, 
and to 24% in the year following 
graduation. 

At the same time, it must be noted 
that expressed preferences do have 
practical significance when viewed from 
a certain vantage point and at certain 
ages. Dyer (1939) showed that, among 
college students, vocational preferences 
which had been constant Over a period 
of years were related to subsequent, 
occupational choice. Strong (1955) and 
McArthur and Stevens (1955) reported 
that, among students at Stanford and 
Harvard, expressed vocational pref- 
erences have considerable Predictive 
value for adult occupation. As Dyer’s 
study involved preferences which were 
constant over a long period of years 
(omitting all cases in which change of 
preference had taken place) and the 
others dealt with expressions of pref- 
erence in late adolescence and early 
adulthood among students of superior 
intellectual, educational, and socio. 
economic status, they do not conflict 
with the studies of more heterogeneous 
groups of early adolescents which have 
already been cited. 

In view of the persistent use of the 
concepts of consistency and wisdom 
of vocational preferences despite their 
tendency to be unstable in early adoles- 
cence, it is important to examine their 
psychological significance. In the Ca- 
reer Pattern Study of the Horace 
Mann-Lincoln Institute of School 
Experimentation several such measures 
were developed (Super & Overstreet, 


DONALD E. SUPER 


1960) and related to other variables in 
the ninth grade. 


Mrtruop 


Subjects 


The subjects of this study were the ee 
group of the Career Pattern Study oe 1 
Horace Mann-Lincoln Institute of Sc ia 
Experimentation, 105 ninth grade boys W : 
were found to be typical in age, irene 
Socioeconomic status, and other key es 
bles of ninth grade boys in area i 
New York, in the early 1950's. As Midd A 
town itself is an average town op a wariely’ 
of social and economic indices (Super 
Overstreet, 1960), these boys may be con 
sidered typical of ninth graders in mang 

erican communities. All of the C i 
boys indicated, in their interviews, at leas 
One tentative vocational preference. 


Measures of Consistency and Wisdom 
of Preferences 


_ Three measures of consistency of we 
tional preferences were developed, the thir 

peasure being essentially a combination of 
me frst two. The expressions of vocational 
obtained in tape recorde' 

hich the subjects of the 
d “... about your plans 
What would you like to be 
wre thirty??? This question 
esigned to keane with nondirective leads 


ep the boy talking about his 
1 Preferences, 


and with probes 
to make sure ¢ 


, hat relevant infor- 
S obtained if not offered spon- 
uper, Crites, Hummel, Moser, 
. arnath, 1957). In the on 
& the first tional 

re our vocatio 
frhesing) ed tht shy ware 
spiel on iW) ware tabulated. (It should be 
OF fixer ang are ate measures of consistency 
point in time, negative Preferences at one 
ae nsistency of first prefer- 

Consi. lency with;. 
NSt. Y within, Fy, 
t ssifying the expres: 
© oceupat, 
Oe 


elds was 
Sed prefer- 
10nal field 
an 


x, ; 
was developed 1, Consistene 


<n Y Withs, 
preferences according to°¥ing te Levels 


INDICES OF VOCATIONAL MA TURITY 37 


oc i 

PW genoee level scale, the score being the 

tae mber of levels at which expressed 
ene fell, minus 1. 

ilies = index, Consistency within Fam- 

on ths = eveloped, by summing the scores 

faniily & ove two indices, an occupational 
field yee defined as a combination of 
at the ates a technical occupations 
by this cae ta constitute a family 
pie em of wisdom or realism of 
study Al | wie asi are relevant to this 
viewed E ough no one of these can be 
icine ri : sufficient index of wisdom of 
which ‘is asi 38 of them involves a variable 
justifiable | ely accepted and objectively 
realism, as one measure or criterion of 
omnes index of wisdom or realism was 
eine aa between Ability and Prefer- 
between’ ie ne accurately, of agreement 
individual ie measured intelligence of the 
intio. of and the intelligence character- 
tion of ear employed in the occupa- 
on the at rst preference. If the boy’s score 

gee is Quick-Scoring Mental Ability 
exeedaat ihe into an AGCT equivalent, 
men in hi at of the bottom quarter of the 

By hie ssa is preferred occupation as shown 
Datiee arg for the Army General Classi- 

wa Shonen his ability was considered to 

teenies repent with the occupational intel- 
oF tang au remene if it was equal to that 

WHS ees the bottom quarter, his ability 

Henivarce” not to be in accord with the 

tides roe wisdom measure was the 

taining B greement between Measured Inter- 
by the § reference. Interests were measured 
slight! trong Vocational Interest Blank, 
ae the. modified to insure comprehension 
iin grade level, and each boy’s 
ory on were classified as primary, second- 
first tertiary in the family in which his 

meee ree fell, using Darley’s (1941) 

Prete he classifying interest score pat- 

eed hen the measured interest pattern 

the primary in the field corresponding to 

pst expressed preference, a score of 4 was 

the ; aid when the pattern was secondary 
joy was given a score of 3, etc. 

2 The third wisdom index was that of 
greement between Occupational Level of 
poeured Interests and Level of Preference. 

@ socioeconomic level of the boy’s inter- 
ests (occupational interest level) was meas- 
wed by the Strong OL scale. If the boy’s 

Ete ‘Score was not more than one standard 
eviation below the mean OL score of his 

Preferred occupation, 28 shown on Table 50 


in Strong’s monograph (1943, p. 192), the 
interest and preference levels were con- 
sidered to be in agreement. 

The final wisdom measure to be discussed 
here is that of the Socioeconomic Accessi- 
bility of Preference. The family bread- 
winner’s occupation was rated according 
to the Hamburger revision (1958) of the 
occupational rating scale included in the 
Index of Status Characteristics (Warner, 
Meeker, & Eells, 1949). The boy’s vocational 
preference was rated on the same scale, the 
agreement or disagreement of the two rat- 
ings was ascertained, and the size of the 
discrepancy was the index of socioeconomic 


accessibility. 
RESULTS 


It should be emphasized that no 
one of these indices is considered com- 
pletely satisfactory, put that there are 
good arguments in support of each of 
them. For example, the occupational 
intelligence data of World War II are 
recognized as an imperfect sample of 
civilian occupations (Super, 1949), but 
they are the best available and have 
proved generally usable; furthermore, 
the use of the first quartile as the cut- 
ting point takes into account the great 
range of intelligence which makes 
possible success in any one occupation. 
In the case of socioeconomic accessi- 
bility, it may be objected that it is 
unwise, in a fluid and democratic soci- 
ety, to judge the realism of a vocational 


preference by its correspondence with 


the status of the parents; but even in 
our democratic society many studies 
show that the occupation entered by 
the child tends to be at the same 
socioeconomic level as that of the 
parent (Super, 1957), and even when 
the child changes socioeconomic levels 
he does it with the help or hindrance 
of his parents’ resources, contacts, 
information, and values. Each of 
these measures of the wisdom of a 
vocational preference is, therefore, one 
of the possible components of a con- 
ceptually more satisfying and valid 
global index of realism, in which 


38 DONALD 


E. SUPER 


TABLE 1 
INTERCORRELATIONS OF INDICES OF CONSISTENCY AND Wispom or NINTH GRADE 


VocaTIONAL PREFERENCES 
Consistency Wisdom 
Index 
Field Level Family Aptitude Interest Level 
Consistency: 
Within Fields 
Within Levels 22* 
Within Families 62 Sls 
Wisdom in terms of: 
Aptitudes 16* 06 18* 
Measured Interests -01 -~12 ~12 06 
Measured Interest Level 26** 04 14 00 —32 
Accessibility 08 13 18* Q7** —09 17* 
Note.—N = 105 boys. Decimal points have been omitted, 


® Spui 
* Significant at the .05 level, one-tailed test, 
** Significant at the .01 level, one-tailed test, 


having the appropriate intelligence 
might, for example, offset not having 
the socioeconomic background which 
would be a help in achieving one’s 
ambitions. 


Agreement among Indices of the Con- 
sistency and Wisdom of Ninth 
Grade Vocational Preferences 


If consistency and wisdom of 
vocational preferences are valid con- 
cepts with which to work in dealing 
with ninth graders, one should find at 
least a moderate degree of agreement 
(correlations of .30 to .50) between 
various measures of consistency of 
vocational preferences, a similar degree 
of agreement among indices of the 
wisdom of the vocational preference, 
and some agreement (correlations of 
.20 to .35) between consistency and 
wisdom indices. Table 1 reports the 
intercorrelations of seven CPS indices 
of consistency and wisdom of voca- 
tional preferences, for the 105 ninth 
grade boys in the core group. ; 

The indices of Consistency of Field 
and of Consistency of Level are, of 
course, highly correlated with the 
index of Consistency of Families: this 


usly high because of use of one measure in the other 


happens because the last-named index 
1s a combination of the first two. Only 
Field and Level in this group of three 
measures are so constructed as to be 
Operationally independent of each 
other, so that the correlation between 
these two measures has meaning; it 
i: -22, significant at more than the .05 
evel but below the .01. This suggests 
& slight tendency for the ninth grade 
hig are most consistent in aspir- 
ph occupations which are at the 

me level to be the most consistent in 
8 to occupations which are in 

© same general field. The implication 


of this very slight relati ip i 
the cobaart of maltteney een 


consistence: 
0! - 
tional preferences y ES von 


” has mini 
ing when boys are in the site Bias 
? 


although it may take 

on Spee 
— at a later age. iis Signifi- 

the Consistency ind} 

ic 

reveals, are not at all rela, a; Tab ie 2 
as measured by the index ° Wisdom 
between Interests and &reement 
and generally unrelated t Teferences, 


Wisdom indices. Cong; © otk 
and Consistency ‘fo cy of Field 
partly on Wield) are relate (based 


and just barely Significant He a low 


“gree, to 


INDICES OF VOCATIONAL MATURITY 


pd of Aptitudes and Preference, 
we ree ed of Family is similarly 
The to Accessibility of Preference. 
a of .26 between Con- 
Fi of Field and Agreement be- 
i riighck pe Level of Measured 
OiSiens * and Level of Preference is 
thts he, an artifact, as each of 
Shere indices is in part based on other 
ee pn which deal largely with 
tie, han alee type of job, and 
levels (¢ jobs are largely at lower 
Saban By Business Contact, Outdoor) 
higher 4 other types are largely at 
Balen on (e.g., General Cultural, 
velepant and Level of Interests or 
Gpanyitonnl is by definition and 
novia te aT ie measure of socioeco- 
lcagiot vel. The relationships between 
Be vga and Wisdom shown by 
negligible. therefore be viewed as 
th : a internal : agreement among 
teers ices of Wisdom of Vocational 
oe must next be considered. 
hae ent between Aptitude and 
Bhoeee is unrelated to Agreement 
ene: n Measured Interest and Pref- 
Lye he 2 to Agreement between 
of Prot Measured Interests and Level 
nt erence; its correlation of .27 
Rin of Preference should 
= a ee viewed as an artifact in 
eel: e facts that the nonpreference 
itis a in the two Agreement meas- 
i intelligence and socioeconomic 
ee : correlate .27 in this sample, 
tae he preference variables (socio- 
fat ea level of preference and 
t elligence level of preference) in the 
WO measures presumably have an 
even higher jntercorrelation because 
In each case one variable (preference) 
1s scaled on one of two other highly 
intercorrelated variables (socioeco- 
nomic status and intelligence). 
Agreement between Measured In- 
terests and Preference is unrelated to 
the other Wisdom indices, two of the 


39 


intercorrelations being nonsignificant 
and the third (—.32) being the opposite 
direction from that hypothesized. And, 
finally, Agreement between Level of 
Measured Interests and Level of 
Preference is also to be viewed as 
unrelated to the other Wisdom in- 
dices, as the barely significant and 
very low correlation of .17 with Ac- 
cessibility can be attributed to an 
artifact: both components of both 
agreement measures are scaled as to 
socioeconomic level, and the socio- 
economic level of the vocational 
preference is one of the two measures 
entering into each of the indices. 

The only conclusion that it seems 
legitimate to draw from Table 1 is, 
then, that the few seemingly significant 
intercorrelations may be the products 
of artifacts, and that the various indices 
of consistency and wisdom of voca- 
tional preferences are unrelated to 
each other. This lack of relationships 
suggests a lack of validity as indices 
of anything significant in the vocational 
development of ninth grade boys. 


Agreement between Consistency and 
Wisdom of Preferences and Other 
Variables 

The construct validity of a set of 
measures of the same variable de- 
pends not only upon the intercorre- 
lations of these presumably similar 


measures, but also upon their agree- 
ment with other variables to which 
theory would lead one to expect them 
to be related. Tf consistency and wis- 


dom of vocational preferences are 
considered to be indices of vocational 
maturity, of the degree of vocational 
development which has taken place in 
an adolescent, then they should be 
related to measures of other character- 
isties which might be expected to 
develop concomitantly with, to result 
in, or to be the product of, vocational 


maturity. 


40 


DONALD E. SUPER 


TABLE 2 


RELATIONSHIPS OF CONSISTENCY AND Wispom or NintH Grape VocATIONAL 
PREFERENCES To STATUS AND BacKGROUND VARIABLES 


i i Peer 
Graae’o | “°Eevel™ | in‘ | Rchicr. | tntersts | Ray | accent 
Consistency : 

Within Fields -11 —04 ~04 09 ie 04 01 
Within Levels —08 | —03 06 00 | —.08 05 05 
Within Families —12 —02 02 00 ei 03 ol 

Wisdom in terms of: 
Aptitudes =16 | -07 12 03 | —.97 -190 | -12 
Measured Interests =o 02 00 06 “O9*« 14 07 
Measured Interest Level 14 11 22 06 O7 _o6 | —-17 
Accessibility 12 42** | 1 o2 | — 09 ~o7 | 02 


Note.—N = 105 boys. Decimal points have been omitted, 


* Significant at the .05 level, one-tailed test. 
** Significant at the .01 level, one-tailed test. 


The Career Pattern Study therefore 
used or developed a series of measures 
of other variables which could be hy- 
pothesized as related to vocational 
maturity. These were age (except that 
the fact that the negative correlation 
between age and intelligence in any 
one grade—in which the older pupils 
tend to be the retarded and the younger 
tend to be the accelerated—and the 
limited age range may be expected 
to confuse the relationship), socioeco- 
nomic level, intelligence, school achieve- 
ment, patterning of vocational inter- 
ests, emotional adjustment, and peer 
acceptance. 

Socioeconomic level was measured 
by the placement of the family bread- 
winner’s occupation on the Hamburger 
revision of the Warner scale, intelligence 
by the Otis Quick-Scoring Test of 
Mental Ability, school achievement 
by grades in the three constant courses 
(taken by all students) of the ninth 
grade, patterning of interests by the 
application of Darley’s (1941) method 
to scores on Strong’s Vocational In- 
terest Blank, adjustment by Over- 
street’s method of deriving a total 
adjustment score from stories told in 
the Thematic Apperception Test (Su- 
per & Overstreet, 1960), and peer 


acceptance by a modification of the 
Guess Who technique (Super & Over- 
street, 1960). 

The correlations of the Consistency 
and Wisdom indices with these vari- 
ables are reported in Table 2. Only 2 
of the 49 correlations are statistically 
significant and in the hypothesized 
direction, and one of these relation- 
ships is due to an artifact: Accessibil- 


ity is (orrelated 42 with Socioeco- 
nomic Level, but the | itutes 
Sere ed atter constitute 


_asured Interests and Pref- 
ane Patterning of Interests 
wit ys whose preferences agree 

“ir Measured interests tend to 


F Cut patterns of red 
interests : JS OF measure 
excellent payeh onship which makes 


chological sense i 
tla eee and which 


despi 
f relationship to otl *pive 


ut when interests are Yr 

only to interests one wonderg if iited 
18 Perhaps a measurement artif there 
work here also.) a 


Five other correlations in p 
are large enough to be si Table 2 
but since they are in the trenificant, 


XPected 


their lack o 
ables. (B 


INDICES OF VOCATIONAL MATURITY 41 


section and do not make psycholog- 
Shige they must be attributed to 
e ce. In a table of 49 correlations, 
wie find 2 or 3 correlations 
the ; pee statistically significant at 
Reece hee oe correlation sig- 
efoto Rl e .01 level, strictly on a 
Tape 9 omelusion to be drawn from 
tobe . as from Table 1, thus appears 
aE hab the consistency and wisdom 
io preferences have little 

ificance for prevocational and 


Vocati F é 
ral choices at the ninth grade 


Implications 


oe the consistency and wisdom or 
ie of a ninth grade boy’s ex- 
pa vocational preferences lack 
dite; y, as the Career Pattern Study 
giochi ee ae does this mean that they 
aie e disregarded in practice? The 
one at consistency and wisdom of 
ee preferences have __ little 

a Pn at Grade 9 does suggest that 

. ie not be used as criteria of 
affection for guidance nor of the 
stage veness of guidance at that 
this a development. Presumably at 
transi age the very instability and 
ia (aeons of expressed vocational 
inde ences, and perhaps also their 
appa quate factual basis, make their 
ha consistency and wisdom 
one y a matter of chance. However, 
con noe be reluctant to draw the 
a Clusion that the preferences them- 
i Ves, even if inconsistent or unwise, 
: ould be disregarded, for this would 
Nvolve not only the assumption that 

€ counselor knows better than the 
Pupil what is appropriate for him, 
ut also the assumption that the best 
Way to help the pupil to choose and 
Plan wisely is to get him to con- 
Centrate on data coming from without 
imself rather than to examine his 
Self-concept in relation to external 


and impersonal data. The demon- 
strated, even though far from perfect, 
validity of aptitude, achievement, 
and interest test data make it clear 
that the counselor does indeed have 
unique externally derived information 
as to what is appropriate for the pupil. 
But it may well be, as many counselors 
believe and as some research shows, 
that the best way to let this informa- 
tion help the pupil is to aid him in 
assimilating it into his concept of 
himself. This is best done by beginning, 
not with the data, but with the self- 
concept. And the statement of a vo- 
cational preference is one way of 
expressing a self-concept, as Bordin 
(1943) and the present writer (1951) 
have pointed out. 

There are, of course, various ways 
of helping in the exploration of the 
self-concept as manifested in an ex- 
pressed vocational preference. One is 
to help the pupil to choose courses, 
activities, part-time employment, etc. 
in which he may find opportunities to 
see if the preferred role does indeed 
suit him, and to discuss his use and 
evaluation of these experiences with 
him as he participates in them. Another 
is to help him to examine his prefer- 
ence in the interview, relating it to 
his picture of himself as a student, as a 
part-time worker, as member of a 
family group, etc. In this discussion 
he may be helped to consider the views 
of him held by other persons and the 
picture of himself obtainable from 
records of his performance. This 
discussion can lead to the desirability 
of obtaining other such external data, 
and more testing or more participating 
in courses or activities may be planned 
and the results later reviewed by pu- 
pil and counselor. By thus using ex- 
pressed. vocational preferencesasspring- 
boards for exploration and growth, for 
self-evaluation and further planning, 
the counselor proceeds democratically 


42 DONALD 
and with respect for his client’s right 
to and need for self-determination, 
while avoiding the error of placing 
undue emphasis on preferences as a 
basis for directional choice. 


SUMMARY 


Questions are raised concerning the 
significance of the consistency and 
wisdom or realism of vocational 
preferences among ninth grade boys, 
and the use of measures of these as 
criteria of the effectiveness of or the 
need for vocational guidance. The 
development and application of several 
such measures in the Career Pattern 
Study are described, and data on the 
construct validity of these indices for 
105 typical boys are reported. ‘The 
failure to find significant relationships 
in the hypothesized directions is taken 
as evidence of the lack of psychological 
and hence of practical educational 
significance of consistency and wisdom 
or realism of vocational preferences at 
this stage of development. It is con- 
cluded that, although they may be 
meaningful at later stages, they should 
not be used as criteria of the need for 
or effectiveness of guidance and coun- 
seling at the ninth grade level. 


REFERENCES 


Beiuin, H. Factors affecting occupational 
choice in a lower socio-economic group. 


Unpublished doctoral dissertation, 
Teachers College, Columbia University, 
1952. 


I. S. A theory of vocational inter- 
a. as dynamic phenomena. Educ. 
psychol. Measmt., 1943, 3, 49-65. 
Carter, H. D. Vocational interests and 
job orientation. Appl. psychol. Monogr., 
sah Ee ets ical application and in 
J. G. Clinica ] , and in- 
<_< of the Strong Vocational 
Interest Blank. New York: Psychologi- 
., 1941. 
jioee, ese T. Relation between voca- 
tional interests of men a eollene and 
; istories years. 
their subsequent histories for ten } 
FF. opeh. Psychol., 1939, 23, 280-288. 


E. SUPER 


Froeticn, C. P. Evaluating ibecente 
cedures: A review of the literature. Was “ 
ington, D. C.: Office of Edueation, 1939. 

Fryer, D. The measurement of interests 
New York: Holt, 1931. 7 

Gitcer, G. A., Jr. Declaration of 0 
tional interest. Occupations, 1942, 29 
276-279. : 

HampBurcer, M. Realism and consistency 
in early adolescent aspirations a 
expectations. Unpublished Hoste 
dissertation, Teachers College, Colum 
bia University, 1958. - 5 dis 

Hewer, Vivian H. Group counseling, an ‘a 
vidual counseling, and a college cla io 
vocations. Personnel quid. J., 1959, 3% 
660-665. sel 

Hoyt, D. P. An evaluation of group. bh 
individual programs in oer 
guidance. J. appl. Psychol., 1955, 379 
26-30. . 

Keravven, G. N., & Hanp, H. C. Apprais- 
ing guidance in secondary schools. Nev 
York: Macmillan, 1941. 

McArtuur, C., & Steve s, Lucra B. The 
validation of expressed interests 8 
compared with inventoried interests: “ 
fourteen-year follow-up. J. appl. Psi" 
chol., 1955, 39, 184-189. : 

Moser, HEuen P., Dupin, W., & SHELSKY) 
- A proposed modification of the Roe 
Occupational Classification. J. counsel: 
Psychol., 1956, 3, 27-31. 

Rornney, J! W. M. Guidance practices and 
resulls. New York: Harper, 1958. 

OTHNEY, J. W. M., & Rozns B. A, Guid- 
ance of American youth.’ Cambridge: 

Mass.: Harvard Univer. P 1950 

Scumipt, J, Ty oTunny, J ASB ey ster 
bility of vocational seh i sh 

al choices of hig 


school students. P, rs i 
Sra to35) 34, 142-146. ne ile 
SPARLING, E. J. Do college st; 
vocations wisely? Teach, Con shoe 
Bdue., 1933, No. 561. * SONATE 
Stonz, C. . Are vocational] 
courses worth their salt? 7 
Measmt., 1949, 8, 


Crientation 
Uc. psychol. 


161-1 
Strona, E. K., Jn. Parag” ; 
men and women. Stanfo. rnlerests of 
i Jniver. Press, 1943. P, ie Stanford 
StRoNG, E. K., Jr. Vocationay : 
years after college, Minneang “ests: 18 
Minnesota Press, 1955 Polis: Univer. 


s om 

TTY, mand ap tee Yc 
d oS psych, - onal 8 
York: Harper, 1949, 9ical tests ee 

Suprr, D. E. Vocation , 


plementing a self_c_. Adjustm 
1951, 30, 88-92, “Peept, g Ment: Im- 
““nations, 


| 


INDICES OF VOCATIONAL MATURITY 43 


Su 9 Y 
per, D. E. The psychology of careers. New Trow, W. D. Phantasy and vocational 


ieee ee 6, 5 — choice. Occupations, 1941, 20, 89-93. 

Moser, ; Fisten ‘ie Otaean Warver, W. L., Meexer, Marcuta, & 
Puorse Bese Wanvard C.F. Woke Eris, K. Social class in America. 
“ional development: A framework for gaa Science Research Associates, 


estore New York: Bureau of Publica- 

Be Teachers Coll., Columbia Uni- 
Stent ve evaluati 

The tet a ee PHOEBE L. tional counseling. Educ. 

cational maturity of ninth-grade I 1, 5-24 

a New York: Bureau of Publica- Remo, ES Ms 

ions, Teachers Coll., Columbia Uni- 

ver., 1960. 


Wiutramson, E. G., & Borvin, E. S. The 
on of vocational and educa- 
psychol. 


(Received June 20, 1960) 


Journal of Educational Psychology 
1961, Vol. 52, No. 1, 44-49 


THE COMPARATIVE INFLUENCE OF PUNITIVE AND 
NONPUNITIVE TEACHERS UPON CHILDRENS’ 
CONCEPTS OF SCHOOL MISCONDUCT: 


JACOB S. KOUNIN 


anp PAUL V. GUMP 


Wayne State University 


This paper reports a portion of a 
research project pertaining to the man- 
agement of childrens’ behavior in 
classroom settings. Because so many 
teachers, especially beginners, verbal- 
we considerable concern about disci- 

pline and control, we are focusing our 
current research in this area. While 
there is some relevant literat 

: ure, such 
as that of Sheviakov and Red] 


: (1944), 
based upon experience and insightful- 
ness, we have been unable to locate 


any generalizations based upon data 
from research. 

In a previous study by Kouni 
Gump (1958) specimen-record anon 
observations were gathered of disci- 
pline incidents during the first week 
of kindergarten, focusing upon the 
triad of: a misbehaving child (target) 
a teacher doing something to stop the 
misbehavior, and a watching audience- 
child. Limiting our dependent variables 
to overt behavior we found that teach- 
ers’ techniques of handling a misbe- 
having kindergarten child (target) did 
have different degrees of socializing 
success upon audience-children. A gso- 
cializing success was defined as an 
observable reduction of overt misbe- 
havior or an increase in conforming be- 
havior (standing up “even straighter” 
in line). Control techniques high in 
clarity (defining the deviancy, specify- 


1A version of this paper was presented 
at the American Psychological Association 
meeting, September 5, 1959. It is part of an 
investigation supported in part by a Re- 
search Grant M-1066, from the National 
Institute of Mental Health, United States 
Public Health Service and in part by a 
grant from the College of Education of 
Wayne State University. 


ing how to stop) were most successful. 
Control techniques high in firmness 
(standing closer to the misbehaving 
child, continuing to look at him until 
he stopped misbehaving) were suceess- 
ful only for audience-children who wer 
themselves deviancy-oriented at the 
time. Control techniques high in rough- 
ness (anger, physical handling) were 
least successful and tended to be fol- 
lowed by behavior disruption (less 
involvement in work, overt signs © 
anxiety) rather than conformity 0? 
the part of audience-children. In terms 
of their effects, it is evident that rough’ | 
ness is a different dimension than firm 
ness, 
ae attitudes toward misconduct 
Pr tas pe = affected by differences i? 
these as al ee mre decided to Btuey 
+ 4n an unpublished study 


at camp, P. G B 
J. Kounin fo ae 


tiveness, goal-di 
-directed: 
wie Psychological Sbdentats 
hers. The campers’ attitud sm, 
misconduct also varieq , —¢S toward 


whether they were talkin ®Ccording +o 


home, or school mj): 0 
1 milicus ut camp, 
bon soc to limit the 1 . © decided, 
Slon to puniti ershi ir 
It is postulated that : 


fo counterseression:; it igetession leads 

Puniti 

power over her pupil, cher ce oe 
mor! 

nt) 


tha 
44 
ey have 


CHILDREN’S CONCEPTS OF SCHOOL MISCONDUCT 


over her and that she blocks over 
manifestation of pupils’ agression pe 
Servations in the classrooms of the 
punitive teachers selected for this 
study indicate that this second assump- 
= ow 4 From these two postu- 
we i 
che erive the following hy- 
fied That the school misconduct pre- 
ee of children with punitive 
pene pe more aggression 
0 e of children wi i- 
te oi with nonpun 
Pn eal children with punitive 
Sinel « will be more conflicted about 
with stone sie than will children 
3 a a teachers. 
ies a ie the aggression needs and 
Botha ct relating to misconduct hy- 
ith sized to exist among children 
ies Eee teachers will detract 
Welues salle with school-unique 
liga not directly related to 
Pi aga question may also be raised 
ea ether or not the amount of 
ites generated in the children with 
Pet lmend punitive teachers is 
oneteies ly great to reduce the rational 
cnt es of their attitudes toward mis- 
uct. 


MetHop 


oe The subjects were 74 boys and 

x a - attending their first semester of 
lates ites grade in the public schools of a 
fan a ptm represented all the children 
fiom x home rooms of three schools, in 
a upper-lower to middle-middle socio- 

Daan neighborhoods. 

rocedure. Overall school climat 
Controlled by selecting pairs of pel 
vs. nonpunitive teachers from the same 
school. Three such pairs were obtained from 
three elementary schools. 

The initial selection of punitive and non- 
Punitive teachers was obtained from the 
Principal and assistant principal. Followin; 
this the classes were observed by Lote 
principal investigators. At approximately a 
week later the teachers were further rated 
by a supervisor of student teachers who 
visited each class twice. 


45 


The raters checked along a continuum 
from Extremely Punitive (threatens chil- 
dren with consequences that really hurt; 
makes threats that imply sharp dislike, real 
willingness to harm child; ever-readiness to 
punish) to Not Punitive (does not punish 
and does not threaten). A punitive vs. non- 
punitive pair of teachers was used for the 
study only when all five persons agreed on 
their dichotomizations. All the teachers 
were rated as having good organization, 
well-behaved classes, and as achieving the 
learning objectives for their grade. Highty- 
four of the children were in classes with 
punitive teachers and 90 children were in 


classes With nonpunitive teachers. 

The children were yoterviewed individ- 
ually during the third month of attendance 
at school. The interview consisted of the 
questions: “What is the worst thing a child 
can do at school?” and, following the reply, 
“Why is that so bad?” Identical questions 
were asked regarding home as the milieu 
for misbehavior. 

Coding the Replies. The misconducts 
mentioned by the children were coded for 
content and for certain qualities or dimen- 


sions. 

The content code (obtained from the 
question of “What is the worst thing to 
do?’’) contained two parts: the misconducts 
and the explanations given for why these 
were bad. The misconduct included: the 
act type (physical or psychological assaults, 
noncompliance, etc.) and the object of the 
misconduct (parents or teachers, other chil- 
dren, institutional laws or custom, etc.). 

The code for the explanation of miscon- 
duct was designed to answer three ques- 
tions: Who is involved in the consequence 
(the child himself, parent or teacher, a peer, 
etc.)? What kind of sufferings result to 
others from the misconduct (physical pain, 
achievement loss, property loss, etc.)? What 
kinds of retributions occur to the misbe- 
haver (work imposal, character loss, physi- 


cal punishment, etc.)? 
RESULTS AND CoNcLUSIONS 


Children probably answer the ques- 
tion of “What's the worst thing a 
child can do in school?” with a report 
of acts that reflect their preoccupa- 
tions. It is not likely that our subjects’ 


answers would have been the same if 


they were presented with a forced- 


choice of alternative acts. Given a 


46 


choice, most children would probably 
rate “stabbing someone” as more seri- 
ous than “talking in class.” If the 
misconducts the children talked about 
are taken to represent tension systems 
and preoccupations, we may infer 
from these the comparative impact of 
punitive and nonpunitive teachers. 

In a concurrent study of childrens’ 

attitudes toward misconduct (201 boys 
and 214 girls in the first grade of six 
public schools representing a range of 
socioeconomic backgrounds), Gump 
and Kounin (1959) found both sex 
differences and differences between 
home and school milieus. For example, 
home misconducts included more 
breaking of objects while school mis- 
conducts included more rule violations; 
parents suffered more than teachers 
in consequences but teachers retributed 
more frequently. However, parents 
were reported as retributing with more 
corporal punishment and with more 
severe punishment than teachers, 
There were also differences in the 
responses of boys and girls, especially 
in school. For example, girls reported 
“talking” as a school misconduct eight 
times more frequently than boys 
whereas boys reported physical Ae. 
saults on peers in school more fre- 
quently than did girls. 

Consequently, the comparison of 
the responses of children with punitive 
and nonpunitive teachers was made 
separately for sexes and also for home 
and school milieus. However, on all 
comparisons of school responses the 
direction of differences between chil- 
dren with punitive and nonpunitive 
teachers was the same for both boys 
and girls. The report of results, there- 
fore, combines both boys and girls. 
Insofar as the differences between chil- 
dren with punitive and nonpunitive 
teachers are concerned, only 2 of the 
48 comparisons of home responses were 
statistically significant: home miscon- 


JACOB S. KOUNIN AND PAUL V. GUMP 


ducts of the children with punitive 
teachers were rated as more serious 
(p < .05, for girls only) and retribu- 
tions to the subject were more serious 
(p < .02, for boys only). It is uncertain 
whether these represent some spillover 
of the influence of punitive teachers 


-onto attitudes toward home miscon- 


ducts, or whether they are chance dif- 
ferences for the number of comparisons 
made. 

The results to be reported here, then, 
refer to boys and girls combined and 
to school misconducts only. Intercoder 
reliabilities ranged from 73-95% 
agreement, with a median of 90. The 
P levels of differences are based on 
the x* test. In the case of dimensions, 
Such as “seriousness,” the results 
were dichotomized into a High and @ 
Low based upon as equal a break as 
a possible and resulting x?s were 
peo mache 2 tables. In the case 
‘ ch as act- allin 
into the categories of: re ahem 
physical assaults on children, property 


damages, o: 
? r nonco fs 
adults, the x2 nformance with 


A. Physi 
Ysical assaults include 
1 ~ 
S on other persons pas ra physical 


Town at 
Y length 


© Seneral 


CHILDREN’S CONCEPTS OF SCHOOL MISCONDUCT 


immorality or danger in the misconduct 
considered from the point of view of the 
overall culture. (The only frame of reference 
which produced high intercoder agreement 
was when the coders took the position of an 
understanding Reformed rabbi or a Uni- 
tarian minister. Taking an unspecified role, 
or that of either a parent or a teacher pro- 
duced low intercoder agreement.) ‘‘Study- 
ing spelling lessons at the wrong time’’ is 
morally trivial while “maiming someone” 
is morally very serious. 

D. Abstractness of misconduct refers to 
the size of coverage. It may range from a 
unique, “one time’? misbehavior, such as, 

cut a climbing rope in gym,” to an ab- 
ce one, such as, “‘be mean to other peo- 

e. 


TY. 
7 A. A central adult is the responsible 
leader: teacher at school. 

‘iB. A psychological loss to another is 
exemplified by “It would make her worry.” 
. C. Seriousness of consequences to others 

ange from trivial harm, such as, ‘‘She’d 
ae to serious ones, such as, “‘He’d 
me reality-centered retribution (this is 
aie only when the perpetrator himself 
the rs in the consequence) is coded when 
Dati ee of a misconduct follows 
sie y from the act-type, such as: “not 
oe oemee you'll get behind in your 
which This contrasts with the response in 
= ich the connection between act and con- 
erence, is dependent upon a personal 
i ervention of another, such as: “not study 
because teacher will make you stand in the 
corner,”’ 

E. “Reflexive justification’ was coded 
when the child gave no consequence for 
either himself or others in his explanation 
of why the act was bad. When he said the 
act is bad because ‘‘It’s not nice’’ or “It’s 

ad” it was called a reflexive justification. 


I. 

A. On ego-acceptability, we sought to 
determine the degree to which the respond- 
ent could see himself as the perpetrator of 
the misconduct. In an ego-alien act, the 
respondent expresses abhorrence, such as: 
“Ips dirty to hit little kids who didn’t do 
nothing to you.” An ego-attractive aah ia 
one in which the child indicates its seductive 
quality for him, such as: “Tell off a teacher 
—boy, I'd like to do that.’ 

B. On the premeditation category, we 
sought to learn the extent to which the 


TABLE 1 
A CoMPpaARISON OF ATTITUDES TOWARD 
Scroot Misconpucts HELD BY CHILDREN 
WITH PUNITIVE AND NONPUNITIVE 
First Grape TEACHERS 


(N = 176) 
Misconducts and Explanations % of RCRA 
I. Content and quality of 
the misconducts: 
A. Physical assaults on | 38| 17 
others 
B. Milieu-serious miscon- | 89 | 63 
ducts 
C. Coder-serious miscon- | 48 | 27 
ducts 
D. Abstract misconducts | 27] 52 
II. Content and quality of the 
explanations: 
‘A. Peers as objects of con- | 94 | 61 
sequences 
B. Physical damage to ob- | 60 23 
jects of consequences 
C. Serious harm to others | 45| 18 
D. Reality-centered ret- | 21 48 
ributions 
BE. “Reflexive justifica- | 11] 26 
tions” as explanations 
III. Role of self in misconducts: 
A. Ego-alien misconducts 26] 11 
B. Premeditated miscon- | 29 15 
ducts 
IV. Aggression: 
A. Overall aggression 49 | 24 
(“blood and guts’’) 
V. Concern with — school- 
unique objectives: 
A. Learning and achieve- | 20 43 
ment losses 
B. Institutional law vio- | 49] 62 
lations 


Note.—aAll differences in percentages are significant 


at the .05 level or beyond. 
® Pu stands for those children who have punitive 


teachers; NPu refers to those children who have non- 


punitive teachers. 


child sought to do wrong. If premeditated, 
the child plans the act and intends the con- 
sequences ahead of time, such as: ‘Put 
thumb tacks on teacher’s chair when she is 
out.” If intentional, the child accepts his 
part in the wrongdoing but does not plan it, 
such as, “talk during a lesson.” 


Iv. 
Aggression (‘blood and guts’’) refers to 
the amount of aggression the respondent 


48 


expresses in his misconducts and conse- 
quences. ‘‘Play in the storage bin because 
somebody might get hurt” expresses less 
aggression than ‘Play in the storage bin 
because you might push a kid off and there 
could be a sharp rock down there and he 
could hit his head against it and crash open 
his skull and he would bleed and his brains 
would fall out and he’d die.” 


V. 

A. A learning or achievement loss is coded 
when interference with learning is the mis- 
conduct or the explanation, such as: “It’s 
bad to make noise because somebody could 
make a mistake in his work,” or “. .. be- 
cause then he couldn’t read good.” 

B. An institutional law violation is a 
violation of the rules of the school such as: 
“talk when you’re supposed to study,” 
“not take your seat when the bell rings.” 


The results presented in Table 1 
may be summarized around the three 
hypotheses and the one question raised 
in the introduction: 

Punitive teachers will create or activate 
more aggression-tension than will non- 
punitive teachers. This is strongly sup- 
ported by the data. The children who 
have punitive teachers have more 
sheer aggression in their sins and con- 
sequences, they give both more milieu- 
serious and more coder-serious mis- 
conducts, their targets suffer more 
harm, they give more physical assaults 
as act-types, and their targets suffer 
more physical harm. The targets of 

children with nonpunitive teachers are 
more inclined to suffer psychological 
losses as consequences. As an example 
of the results: of 84! respondents with 
punitive teachers, 31 give physical as- 
saults on other children and 40 men- 
tion school rule violations; while of 90 
children with nonpunitive teachers, 15 
talk about physical assaults and 56 
about rule violations. (The remainder 
of the act-types are nonconformances 
and “miscellaneous.’’) . 

Children with punitive teachers will 
be more unsettled and conflicted about 
misbehavior in school. This hypothesis 


JACOB S. KOUNIN AND PAUL V. GUMP 


is supported by the findings related to 
the role of self in misbehavior. The 
children from nonpunitive teachers 
give misconducts in which their owD 
role is intentional whereas children 
from punitive teachers give both pre- 
meditated and ego-alien misconducts. 
We may say that children with punl- 
tive teachers express more abhorrence 
for the misdeeds which they have se- 
lected and yet select misdeeds which 
require “malice and forethought.” 

Punitiveness of teachers will detract 
from childrens’ concern with school- 
unique values. This hypothesis is suP- 
ported. Children from punitive teach- 
ers talk more about physical attacks 
on peers—misbehavior by no means 
unique to the classroom setting. Chil- 
dren with nonpunitive teachers talk 
more about learning, achievement 
losses, and violations of school-uniqué 
values and rules. 

Do children from nonpunitive teachers 
show more rational qualities in their 
responses? The answer to this question 
is not clear. Fairly direct, attempts to 
measure this—codes for milieu likeli- 
coneguenc ts fr lkthod of 

? or appropriaten' 
of consequences to he sierenduak = 


did we C 
aketen show significant differences 


- But these 


Punitive teag 


as indications of 
conflicted state of the pat 
ing misconduct held : 
punitive teachers, 
clined to misbehave bite chi 
€ay 


TS t, 


CHILDREN’S CONCEPTS OF SCHOOL MISCONDUCT 


2 concrete act occurs to him—“hit 
George in the mouth”; when he is not 
ang by his needs to misbehave, 
ie an abstraction occurs to him— 
‘ @ mean to people.” Similarly, when 
€ expresses this verbal act, a real 
ae occurs—the target gets 
a or the perpetrator suffers a con- 
=e ge when he is not preoccupied 
ie wrongdoing then a reflexive jus- 
: cation occurs to him—“it’s not nice.” 
— justification at this age 
sell ae be a primitive reply but a 
Pee: ion of a settled issue: ‘““You just 
n’t do this because it’s not nice.” 
Pec interpretation is to} regard 
oe use of reflexive justification 
fe. e children with nonpunitive 
oe ers as evidence of their greater 
ne and faith in school, i.e., of their 
gece of school values more 
Ine children with punitive teachers. 
i big of the data showed the re- 
cna justification was used predom- 
dition in connection with rule vio- 
on ahs (talking, running in halls, 
satin a ee seat, and the like). These 
ite ucts are milieu-inconvenient 
hens are disturbing to the milieu but 
eith are without direct harm to 
So the actor or to others and do not 
‘aiken e roe important moral code. Such 
"Sg ucts to the first grade child 
“the me real explanation except that 
x. y’re bad because they say so.” 
ge they express a sort of naive 
and trust in the rightness of 
what the teacher says- 


49 


SUMMARY 


Three pairs of punitive vs. nonpuni- 
tive first grade teachers were selected 
from three elementary schools. The 
174 children in these teachers’ class- 
rooms were individually interviewed 
about what they thought was “the 
worst thing to do in school” together 
with their explanations of why these 
misconducts were bad. Regarding their 
responses as expressions of their pre- 
occupations it was concluded that, as 
compared with children who have non- 
punitive teachers, children who have 
punitive teachers: manifest more ag- 
gression in their misconducts, are more 
unsettled and conflicted about mis- 
conduct in school, are less concerned 
with learning and school-unique values, 
show some, but not consistent, indica- 
tion of a reduction in rationality per- 
taining to school misconduct. A theory 
that children with punitive teachers 
develop less trust of school than do 
children with nonpunitive teachers 
was also presented to explain some of 
the findings. 

REFERENCES 
Gump, P. V., & Kounrn, J. 8. Sex and 
milieu influences in children’s concepts 
of misconduct. Amer. Psychologist, 
1959, 14, ae ee 


Kounin, J. 8., Gump, P. V. The ripple 
effect in discipline. Elem. sch. J., 1958; 


Sunviaxov, G,, & Rept, F. Discipline for 
today’s children and youth. Washing- 
ton, D. C.: National Education Asso- 


ciation, 1944. 
(Received June 29, 1960) 


Journal of Educational Psychology 
1961, Vol. 52, No. 1, 50-60 


A LONGITUDINAL STUDY OF THE PRIMARY 
MENTAL ABILITIES TEST 


WILLIAM J. MEYER anv A. W. BENDIG 
University of Pittsburgh 


A major criticism of research con- 
cerned with developmental changes in 
the structure of intelligence has been 
the lack, at each age level, of com- 
parable test batteries and subjects 
(Anastasi, 1948). It can be argued that 
inappropriate test content at one age 
level or another serves to lower relia- 
bility which in itself could account for 
age changes. Differences in variability 
of scores can similarly affect the usual 
statistical procedures employed in 
these kinds of studies. Accordingly, 
longitudinal data would seem to be 
one means of reducing the effects of 
these variables. The writers have for- 
tunately been able to employ a sample 
of 100 subjects who were examined on 
two occasions separated by a time 
interval of approximately 3.5 years 
with a test standardized for both de- 
velopmental levels. The purpose of 
this paper is to present several analy- 
ses of the data appropriate to each of 
the following issues: developmental 
changes in the magnitude of g, con- 
sistency of relative position and eon- 
sistency of profiles, consistency of 
factorial structure (factor validity), de- 
velopmental changes in magnitude of 
sex differences and rates of growth on 
each of the primary abilities, and ac- 
curacy of long range predictions of 
achievement. 


Mertuop 


The Primary Mental Abilities Test 
(PMA), Intermediate Form (age 11-17), was 
administered to all of the eighth graders 
present in a junior high school located in 
a small industrial community.! Approxi- 


1 Appreciation is expressed to H. S. 
Konvolinsa, Superintendent of Schools, 


50 


mately 3.5 years later, during the last week 
of Grade 11, the same form of the test was 
readministered, at which time only 8% of 
the original sample were, for a variety 0 
reasons, not in attendance. As shown 12 
Table 1, the final sample of 100 youngsters, 
49 boys and 51 girls, scored somewhat above 
average on the test (Mjq = 107.95) with the 
variability comparing favorably with the 
standardization group (¢ = 16.56). AS 
safeguard against selective factors influenc- 
ing the data, only those subjects available 
at both testing sessions were included in the 
final analysis. An important feature of this 
sample is the fact that the subjects are 
homogeneous with respect to socioeconomle 
background (lower middle class) and educa- 
tion. Most of the boys were in the genera 
education program whereas most of the 
girls were enrolled in a commercial program. 

The PMA yields scores on the five pte 
sumably independent traits which Thur- 
stone (1938) has defined as constituting in- 
telligence. These five traits include Verbal 
Meaning (V), Space (S), Reasoning (R); 
Numerical (N), and Word Fluency (W). In 
addition a Total score (T) is available. The 
reliabilities of each of the subtests as well 
as the Total score are quite satisfactory 
(Thurstone, 1958). It would have been help- 
ful to have available reliabilities by age OF 
grade levels, but unfortunately such data 
are apparently unavailable in published 
form. Since the test was standardized for a} 
age group inclusive of the present sample, 
the assumption is made that reliabilities 
are essentially comparable over the age 
Tange inclusive of Grades 8 and 11. In addi- 
tion to the foregoing psychometric features, 
this scale was employed because the types 
of analysis planned had not been previously 
reported in the literature for the PMA, 

On the day following the administration 
of the PMA, the Myers-Ruch High Soho] 
Achievement Test (MRT) was adm is 
tered. Eight students were absent, for ae 
test so that analyses including the Mepis 

are 


Duquesne, Pennsylvania, ang fis 
Keegan, guidance counselor ak 
school, for their cooperation jp th, 
of this study. 


H. Me- 
the high 
© conduct 


A LONGITUDINAL STUDY OF MENTAL ABILITIES 51 


i TABLE 1 
re AND STANDARD DEVIATIONS OF 
HRONOLOGICAL AGE AND IQ aT GRADE 
8 anp Grave 11 


Ste Grade 8 Grade 11 
is- 
tic 
B G T B G Tr 
N 
49 51 100 49 51 100 
M, 
my 12.84 | 12.72] 12.78] 16.28} 16.17 | 16.23 
Ls 67 43 57 62 42 55 
M, 
cis ue 108.20 | 107.95 | 106.24 | 111.65 |108.51 
7.78 | 14.41 | 16.56 | 15.81] 14.30 | 15.05 


b 

Te pe N of 92, 46 boys and 46 girls. 
testing #3 was selected because of the short 
samply ime (one hour) and the fact that it 
Tt would y variety of subject matter areas. 
totize probably have been more desirable 
rant ny of the achievement tests yielding 
avérall y of specific scores rather than an 
time n score, as does the MRT, but the 
nit ecusenty, to administer such tests was 

available. 


Resvuits 


Developmental Changes in the Magni- 
tude of g 


Peas (1946) in his well known 
“ opmental theory of intelligence 
oy he that the relative dominance 
erence g decreases with age, 
fat ing in the emergence of specific 

ctors. The inconsistent findings re- 
ge in the literature would seem to 
@ the result of the methodological 
Problems inherent in the cross-sec- 
tional approach (Anastasi, 1948). A 
study by Asch (1936), who retested in 
Grade 7 the subjects originally tested 
by Schiller (1934) in the third grade, 
deserves closer attention because of 
the longitudinal approach employed. 
Consistent with Garrett’s position, 
lower correlations between the verbal 
and numerical tests were found after 
the 4 years. It should be noted, how- 
ever, that only 161 of the original 
sample of 395 were available for the 


second testing. In addition to the ef- 
fects of losing 59% of his original 
sample, the degree to which the tests 
were appropriate for the older subjects 
was questionable. 

The present data were analyzed in 
a manner consistent with earlier stud- 
ies so as to provide a basis for compari- 
son. Several investigators (Asch, 1936; 
Garrett, Bryan, & Perl, 1935) have 
reported median 7’s derived from inter- 
correlations among the subtests at 
each age level. A similar analysis pre- 
sented in Table 2 for the total group 
(N = 100) reveals a slight decrease 
(.02) in the magnitude of the inter- 
correlations. In a separate analysis by 
sex, also shown in Table 2, the median 
intercorrelations for boys were .31 and 
28 and for the girls they were .32 and 
.27, indicating little or no sex differ- 
ence. A second analysis, involving the 
determination of the proportion of 
variance accounted for in the first 
unrotated centroid factor, supports the 
foregoing conclusions (the percentages 
being 39% and 37 %, respectively). 
Since both analyses are assumed to 
be related to the magnitude of g, it 
may be concluded that within the age 
range of the sample there is little or 
no evidence for increased differentia- 
tion of abilities. It should be noted, 
however, that several of the intercor- 
relations are moderately high, particu- 


larly those involving the V subtest, 


suggesting that the g factor is present. 

Previous research (Meyer, 1960; 
Thurstone, 1938; Thurstone & Thur- 
stone, 1941; Tyler, 1958) along with 
the present data include a develop- 
mental span ranging from Grade 1 
through senior high school and up to 
the age of 25 permitting tentative 
conclusions concerning age-grade 
changes in performance on the PMA. 
Tyler (1958) reports a longitudinal 
study in which the performance of the 
youngsters in the first grade. is com- 


or 
iw) 


WILLIAM J. MEYER AND A. W. BENDIG 


TABLE 2 


INTERCORRELATIONS AMONG Primary MENTAL ABILITIES SuBSCALES AT GRADE 8 
AND AT GrRabE 11 


Mean Correlation 


Grade 8 Grade 1 
Boys Vv 38 520 5G AT BL 28 
(N = 49) Ss 37 .29 AT 06 
R 43 (12 43.03 
N 53 11.56 .19 
Ww 260 6.10 178 
Girls v 41.30.4926 820.27 
(N = 51) S .38 42 16 119 
R 42 31 30.39 
N 54.0748 24 
Ww 12.29 12.04 
Total Vv 37 42.5388 81.29 
(N = 100) Ss 34 33 0.14.10 
R 46 18 38.20 
N 55.06.54 24 
Ww 242 ©6815 6.18 16 


Note.—Grade 8, above principal diagonal; Grade 11, below principal diagonal, 


pared with their performance when in 
the fourth grade which in turn is com- 
pared with their eighth grade scores. 
From Grade 1 to Grade 4, it was found 
that total score predicted subsequent 
performance on each subtest about as 
well as prior performance on the specific 
subtest itself.. The comparison from 
Grade 4 to Grade 8 revealed that for 
N the correlation between correspond- 
ing subscores was considerably higher 
than the correlation between Grade 4 
T score and Grade 8 N score. Meyer 
(1960), who performed the same analy- 
sis on the PMA from Grade 8 to Grade 
11, found that in addition to N the 
3 subtest also emerges. It should be 
noted, however, that all of the corre- 
sponding subtest correlations were 
higher than the correlations between 
total score and each subsequent sub- 
score. Thurstone (1938) working with 
eighth grade children reports a correla- 
tion between the six primary factors 
(which he called a second-order general 


factor), but in another study (‘Thur- 
stone & Thurstone, 1941) in which 
the age range was from 16 to 25, n0 
Such second-order factor emerged. 
From these studies, then, the conclu- 
Sion may be made that in the primary 
grades (14) the PMA abilities meas- 
ure general ability whereas from Grade 
4 to 8 numerical ability emerges from 
the others as a more discrete aptitude. 
After Grade 8, a spatial ability 
emerges, but not until the Age Group 
16 to 25 is there evidence for inde- 
Pendent factors. 


Consistency of Relative P 
Profile Differences 


An important characterigt: 
test is the degree to ane of any 
performance predicts future mien 
the particular variable. Certaint ings on 
things can happen to an j y many 
which might effect a signigg, “dividual 
in his performance Telative nt change 
which would serve to mins 8 group 


osition and 


A LONGITUDINAL STUDY OF MENTAL ABILITIES 


ee of prior information. Dop- 
on and Bennett (1951) examined the 
Hen no consistency of the Differ- 
ee eee Tests (DAT), report- 
iat 4 high correlations over 2 
fame ‘ 3 years with very little differ- 
ity etween the sexes. Certain tests, 
ae a Reasoning, and Nu- 
eaey bility, showed more consist- 
ocak: time _than tests such as 
~r) es Reasoning, Space Relations, 
ico a Ability, prompting the 
ta ion that such differences in 
ee ee ney are related to a uniformity 
ta in the former abilities 
ebooks the latter, being nonschool 
ual int i are more subject to individ- 
lie oe and experiences. The 
pothesis, a are relevant to this hy- 
aoe considering the data analysis 
it wilt “ni followed in this section, 
liabilit @ necessary to examine the re- 
i rs of _the subtests. Obviously if 
Sek mediate reliabilities of the sub- 
bane low, consistency measures 
Mia would be meaningless. The 
hoc ME split-half coefficients have 
shes reported in the manual (Thur- 
§ e, 1958) for tenth graders: V, -92; 

Rnd te balers and W, .722 
Ficlets reliability coefficients are suf- 
ine y high so that any observed 
3.5 ges in relative position over the 
— could reasonably be ascribed 

actors other than errors of measure- 
Ment. 

A summary of the analysis is pre- 
Sented in Table 3. These test-retest 
Correlations are based on the same form 
of the PMA and, for the total group 
are remarkably high except for W. The 
fact that performance on V, R, and N 
are more stable over time than are S 

2 Reliability for the Word Fluency subtest 
was determined by use of the separately- 


timed halves technique originally reported 
by Anastasi and ‘Drake (1954). 


53 


TABLE 3 
CoRRELATIONS BETWEEN Grape 8 AND 
GrabE 11 ScoRES ON THE PRIMARY 
Menta Apitities TEST 


| Boys Girls Total 


Verbal Mean- | 88 (.96)|.74 (.80) .81 (.88) 


ing 
Space 51 (.84)|.77 (.80)|.66 (89) 
Reasoning 76 (.81)|.71 (.76)| .75 (.81) 
Number “83. (.93)|.63 (.71) .73 (.82) 
Word Fluency) .52 (.72)|.27 (.88)) 48 (.59) 
49 51 100 


N 


Note.—Entries in parentheses are coefficients cor- 


rected for attenuation. 


and W is consistent with the findings 
of Doppelt and Bennett (1951) de- 
scribed above and would seem to sup- 
port the notion that the academic 
abilities are more stable than the non- 
academic. In contrast with previous 
findings, our data suggest the existence 
of sex differences in consistency. Sta- 
tistical tests support this observation 
for V, 8, and N (2 = 2.08, 2.20, 2.17, 
respectively), but not for W and R 
(2 = 1.45 and .52, respectively). These 
data can be interpreted as meaning 
that over the Grade Range 8-11 the 
relative position of boys as contrasted 
with girls is generally more stable. 
Another problem of particular im- 
portance when dealing with specific 
traits as on the PMA is that of profile 
stability. Guidance counselors in par- 
ticular are concerned with the long 
term stability of difference scores since 
they are often used to determine a 
person’s future program of study. This 
been examined by 
for the 
DAT, who report a me 
of .50 for all possible combinations of 
differences between test scores with a 
range from .20 (Numerical Ability 
minus Space Relations) to .74 (Me- 
chanical Reasoning minus Spelling). 
Little in the way of sex differences was 


noted. 


54 


TABLE 4 
RETEST CORRELATIONS BETWEEN PRIMARY 
MENTAL ABILITIES SuBTEST DIFFERENCE 
Scores aT GRADEs 8 AND I1 


PMA Subtests 72% rif  cyFetal, 
vV-s .30 —.12 Be 
V-R -65 a9 42 
V-—wN .68 -10 39 
V-wW -56 — .02 27 
Ss -R 53 .07 37 
S-N .66 —.10 31 
Ss -W 44 -06 33 
R-—N -76 =.19 25 
R-—-W -50 — .03 29 
N-—W 36 —.02 15 


The correlations presented in Table 4. 
were computed by taking all the pos- 
sible nonredundant differences between 
subtests at each grade level and com- 
puting product-moment 7’s for ors 
responding pairs. The median r for the 
total group is .35 with a range from .14 
(V minus §) to .42 (V minus R). These 
data are certainly not encouraging sup- 
port for the use of profile differences, 
Further examination of Table 4 shows 
a higher median r for boys (.55) than 
for girls (.08) which was to be expected 
in the light of our previous findings 
concerning consistency of performance, 
A tentative explanation for these sex 
differences is that the academic pro- 
gram pursued by the boys is more in 
conformity with the abilities measured 
on the PMA at both grade levels 
whereas for the girls the commercial 
courses have the effect of changing 
their relative strengths and weaknesses 
in Grade 11 in contrast to what they 
were in Grade 8. : 

The generally lower correlation of 
difference scores noted for the PMA in 
contrast with the DAT requires | fur- 
ther examination. The most likely 
explanation for these divergent results 
lies in the different statistical tech- 
niques used for analysis. Doppelt and 
Bennett (1951) computed their meas- 


WILLIAM J. MEYER AND A. W. BENDIG 


ures of relationship by means of & 
formula? which assumes equal vati- 
ability among the variables, an as- 
sumption for which they do not offer 
supporting evidence. Judging from the 
data presented in the manual for the 
DAT, however, it would appear that 
the variance in Grade 12 would have 
been considerably larger than in Grade 
9. If the assumption were seriously 
violated however their approach would 
yield an overestimate of the degree © 
relationship. Since our data did not ful- 
fill the assumption of homogeneity © 
variance Pearson product-moment cor- 
relations were used. The median 
magnitude of relationship for the total 
group, using the Doppelt-Bennett aP- 
proach, would have been .55, ranging 
from .46 (R minus W) to .68 (S minus 
N), which represents a substantia 
improvement over the results derive 
from product-moment PB 


Consistency of Factorial Structure 


In the analysis presented in this 
Section our primary concern is with the 
degree to which the factor loadings oP 
each of the PMA subtests change ove! 
time, Le., factor stability. This prob- 
lem is of importance not only for the 
theoretical issues involved but also iD 
terms of more practical considerations: 
To our knowledge empirical data have 
not previously been reported on this 
= for the PMA. 

© scores of the 96 subjec 
were present for both administra s 
of the PMA and the MRT Were inter- 
correlated for boys and girls Separ tel 
and for the total group. Lach = na 
11 X 11 matrices was factor anal bee 
by the complete centroid method = 

he _ tar + ran ~ FT 

9p = VE VE 

"hry 


coon the difter- 
testings, 


where (1-2) and (I-IJ) repre 
ence obtained at first and g, 
respectively. 


A LONGITUDINAL STUDY 


TABLE 


Firsr- 
Oa es Seconp-ORDER REFERENCE V 
NG Frrst- AND SECOND-ORDER PRIMARY N 


OF MENTAL ABILITIES 55 


5 
EcTOR MATRICES AND INTERCORRELATIONS 
fenTaL ABILITIES FacTORS 


Total Boys Girls 
Grade Level Scales (NV = 92) (N = 46) (V = 46) 
vis|RIN|W IV 
: = 
62|—01|—04|—01| 09)39] 54 
PMA-S |—02| 67| 05) 05|—02/45|—05 
PMA-R_ | 05|—01| 63/—03} 00)40) 18 
PMA-N | 11|—06|—03} 58|—02)35] 11 
ae pMA-w | 11[—07| 04|-05| 58/36) 09 
2 pMA-v | 51| 03|—04] 16) 02/20) 62 
PMA-S | 05| 68|—06|—05] 03)47) 11 
PMA-R |—07] 01| 64] 10) 04/43)—17/ 
PMA-N |—07] 04] 06] 63|—01/41)—03 
pMaA-w |—05| 07] 01] 01) 61/38}—01 
—— HS Ach. | 47] 02} 12|—13|—16]28| 43 
‘ort : 
oo among fac- Vv 46} 58} 60] 38 
s 36] 10| 18 
R 56] 29 
= N 32 
econd-or di zal al ers ie 
loadings ler vector A 29| 51/ 02|—-22) 20] | 39} 48)—01)—-03 37} | 10 
: B 13|—27| 36] 57] 05) | 35|—09} 61) 60 —33| | 52/—-05] 32) 60)—21 
Factor; —| =| (SS 3) roe 
tor intercorrelation 87 66 65 


six 
oe factors being extracted. 
each m ee factor showed, in 
Breatey ‘fun one or more loadings 
oadings an .30, while none of the 
20. ro on the sixth factor reached 
Act nie ai only the first five 
rota; —— retained in the subsequent 
Woes si he centroid factor loadings 
ure usi ated to oblique simple struc 
lone sing the oblimax analytic rota- 
riterion developed by Pinzka and 


4 
i 6 * 
and ee a are indebted to Gary Lotto 
cilities of the Kehl for providing the fa- 
ompiitati e University of Pittsburgh 
fox bare ion and Data Processing Center 
givin of the statistical analyses. A table 
my g the unrotated centroid factor load- 
@ and transformation matrices has been 
Goo with the American Documenta- 
fon Institute, Order Document No. 6544 
PR ADI Auxiliary Publications Project. 
otoduplication Service, Library of Gone 
et Washington 25, D- C., remitting in 
Vance $1.25 for microfilm or $1.25 for 
Photocopies. Make checks payable to: Chief, 
pr aa cea Service, Library of Con- 
ess, 


and the correlations 
) among the oblique 
were converted into 


Saunders (1954) 
(direction cosines 


reference vectors 
factor intercorrelations by the pro- 


cedure detailed by Cattell (1952, pp- 
224-232). Two centroid second-order 
factors were extracted from the factor 
correlation matrices and rotated to 
oblique simple structure using the 


same procedures. 

Table 5 gives the oblique vector 
loadings of each test, the correlations 
among the primary factors, the vector 
loadings of the five primary factors on 
two second-order factors for each sex 
group and also for the combined groups. 
The primary factors within each anal- 
ysis are clearly Thurstone’s V, 5; R, N, 
and W factors, but the factor intercor- 
relations are quite large jn several cases, 
indicating that the PMA factors can- 
not be considered as © 


rthogonal or 
independent. The presence of a single 
factor at the seco 


nd-order level might 


56 WILLIAM J. MEYER AND A. W. BENDIG 


be considered as evidence of a general g 
factor unifying the five primary factors, 
but it appears that two factors must be 
postulated at the second-order level to 
account adequately for the factor inter- 
correlations at the first-order level. The 
large correlations between the second- 
order factors suggest a general g fac- 
tor is present but at the third-order 
rather than at the second-order level. 

Certain interesting sex differences in 
the factor structure of the PMA scales 
are apparent in the separate sex group 
analysis that are obscured when the 
groups are combined. The V scale ap- 
pears to be less well defined at the 
eleventh grade level for the girls (load- 
ing of .36) although its loading is quite 
high at the eighth grade level (.69) and 
at both grade levels for the boys (.54 
and .62). The V scale also has a con- 
siderable loading on the N factor (.37) 
at the twelfth grade level for girls al- 
though none of the other scales simi- 
larly load on two primary factors in 
either matrix. Second-order Factor A 
has loadings on both the S and W pri- 
mary factors and Factor B on the V, R, 
and N primary factors for both sex 
groups. However, Factor V divides its 
variance between Factors A and B for 
the boys while V is loaded only on B 
for the girls. In spite of these apparent 
sex differences the intercorrelations 
between Factors A and B are quite 
similar in both sex groups (.66 and .65), 
but this correlation is much larger when 
all subjects are combined into the total 
group (.87). 

Considering first the results for the 
total group it seems quite clear that a 
high degree of consistency among the 
PMA factors prevails from Grade 8 
through Grade 11. It can be concluded 
from these data that the structure of 
intelligence changes very little over the 
time period included in this study. The 
large correlation between the factors 
suggests that the structure of intelli- 
gence is not composed of independent 


traits but rather that these traits share 
a common source of variation. The 
foregoing interpretation is consistent 
with Vernon (1950) who proposes 
hierarchical structure of intelligence 
with Spearman’s (1927) g as the major 
integrating factor. 

Considering next the data for the 
boys and girls, it can be concluded that 
factorial stability is greater for boys 
than for girls. That factorial structure 
changes for the girls as reflected in the 
lower factor loading on V in the elev- 
enth grade as compared with its load- 
ing in the eighth grade is not surprising 
in view of the previously reported 
findings. A tentative explanation © 
these findings will be discussed in @ 
later section of this paper. 


Developmental Changes in Magnitude 
of Sex Differences and Rates of 
Growth of Each of the Primary 
Abilities 

Herzberg and Lepkin (1954) report 
comparisons between boys and girls at 
cach of three age levels (16, 17 18) oD 
the subtests of the PMA. At each age 
woah — Subtest was significantly 

: & + or the boys. Scores on the sub- 

ests V, R, and W were significantly 


eS for ny girls at age 17 but not at 
e » and at age 18 \ 
reliably higher for the gi ee 


thus confounding an i 

cerning sex eéeenae at yausions con- 

of development. The prego lous levels 

not hampered by this limitat ata are 
Sex differences on each ie 

tests were determined foy the sub- 


ach grade 


the girls. Though scoreg on thover of 


© space 


A LONGITUDINAL STUDY OF MENTAL ABILITIES 


‘ TABLE 6 

EX 5 

ap se oe on Primary MENTAL 
ABILITIES SUBTESTS AT GRADES 


8 anv ll 
PMA Grade 8 Means Grade 11 Means 
Subtest 
Boys | Girls| ¢ | Boys | Girls} # 
: 16..83]17 .80 .75|26 .65|30 .63|1 .99* 
rf 20.87/18 .59|1 .18}30 .69]28 .50|1 .06 
- 12. 24|14 ,19]1 .63|17 .02|19.72/2.29° 
y 15 .19|17 .73/1 .63}19 .61)24 28/252" 
29 .28|33 .00|1.73|36.78|42 .06|2.29* 
“p< .05. 


fac 
t! ae the boys at both grades, 
Signifcant are are not. statistically 
ido ae To the degree that our 
the 16. subjects are comparable to 
in “sagas of Herzberg and Lep- 
would © correspondence of findings 
enth am ge the conclusion that elev- 
on ¥ - e girls achieve higher scores 
» R, N, and W than boys. 


Long 7 
9 Term Prediction of Achievement 


Wiis ection 3 
roid section is concerned with the 
tistics te of each of the subtests to 

ov achievement 3.5 years later. 
are oe presented in Table 7 

fads 17 on Grade 8 subtest scores and 
ae achievement as well as Grade 
ment, i scores and Grade 11 achieve- 
ure of will be recalled that the meas- 
the M: ong seasons is performance on 
ivahels 4 The magnitude of these 7’s 
thier a particularly the V 
are in or boys. Generally these data 
(Shin agreement with other studies 
thin n, 1956; Thurstone, 1958; Well- 
toi 1957), particularly with respect 
ites e overall superiority of V in pre- 
Be ing achievement. The sex differ- 
Shiv. in predictive power agree with 
inn (1956) who found small but 
consistent differences in the magnitude 
of r in favor of boys- That our differ- 
ences are larger is probably attribut- 
Able to the greater proportion of our 
girls being enrolled in the commercial 


57 


TABLE 7 


Correnations or Primary MENTAL ABILI- 


TIES SUBTESTS ADMINISTERED AT GRADES 
8 anp ll wit Rucw Hicw ScHoou 
ACHIEVEMENT TEST ADMINIS- 
TERED AT GRADE 11 


| " 
| aw Sie orig aren 
PMA | 
Subtest | A 
\Grade |Grade |Grade | Grade |Grade \Grade 
| 8 | n | 8 il a | ae 
Vv 70 | .74| .56 45 | .58 | .52 
Ss 33 | .26 | .23 29 | .80| .28 
R 5 49 | .36 (34 | .86 | .32 
N 54 | .46 | .20 12 | .31 | .20 
W 2A | 22) 17 —.05 | .14 | .05 
een eee ee SS eas 


program wherein preparation for a test 
such as the MRT is less effective than 
a more academically oriented course of 
study. 

Since it is ordinarily expected that 
concurrent validity is higher than long 
term validity, the lower correlations 
between eleventh grade achievement 
and eleventh grade PMA scores than 
between eighth grade PMA scores and 
eleventh grade achievement requires 
some consideration. After discarding 
several psychological hypotheses 4s 
being illogical or untestable, the writers 
next examined the data for possible 


statistical explanations. One possibil- 
the difference in 


ity considered was 
variances from Grade 8 to Grade 11 on 
the PMA subtests but, it will be re- 
called, the Grade j1 variances are 
greater which should have led to larger 
correlations. Still attending to the 
difference in variances, testable sta- 
tistical hypothesis was developed. 
Considering the definition of r as the 
ratio of the covariance to the product 
of the standard deviations of the two 


variables® it can be shown that if the 


6 This formula symbolicall 
_ Bay 


y is 


r 
Oxy 


e shown that the ratio 
lations is proportional 
dard deviations. 


from which it can b 
of any pair of corre 
to the ratio of the stan 


58 


covariance remains constant and the 
of one of the two variables remains 
constant then any increase in the size 
of the second o must reduce the mag- 
nitude of the resulting correlation co- 
efficient. In the present problem the ¢ 
remains constant for the achievement 
test variable but not for the PMA sub- 
test variable which is larger for the 
eleventh grade than the eighth. The 
crucial question remaining is whether 
or not the covariation between Grade 8 
subtest scores and Grade 11 achieve- 
ment scores is similar to the covariation 
between Grade 11 PMA scores and 
Grade 11 achievement scores. Direct 
computation of the essential covari- 
ances indicated that in every case they 
were equal or approximately equal. 
These latter findings then permit the 
conclusion that the observed differ- 
ences in predictive validity are merely 
statistical artifacts. Nevertheless it 
should be noted that the greater vari- 
ability in the eleventh grade PMA 
scores does not improve their predic- 
tive power. We believe this occurs 
because the harder items at the older 
age level are less appropriate discrimi- 
nators for a test such as the MRT. Cer- 
tainly further research is necessary, 
using a variety of achievement meas- 
ures, before these assertions will have 
any generality. 


Discussion 


Of all the conclusions warranted by 
this study the most surprising are those 
involving sex differences. The fact that 
scores on the subtests V, R, and W 
were significantly better for the girls is 
consistent with other data, but that 
these girls perform reliably better than 
the boys on N is clearly inconsistent 
with other studies (Herzberg & Lepkin, 
1954). In addition the consistently 
higher correlations found for the boys 
in contrast to the girls is unique in the 
literature. This latter finding is prob- 


WILLIAM J. MEYER AND A. W. BENDIG 


ably best explained statistically in that 
the performance of the boys is more 
variable in Grade 11 than that of the 
girls and this, combined with the higher 
mean performance of the eleventh 
grade girls, suggests the presence of a 
ceiling effect for the girls. As a basis for 
a tentative explanation of the observed 
sex differences, we postulate that the 
training received by the girls in the 
commercial program leads to changes 
in the organization of intellectual 
abilities which do not result from the 
academic program of study followed by 
the boys. Recall that for the eleventh 
grade girls the V factor splits its varl- 
ance with the N factor indicating that 
these factors overlap. We postulate 
that this overlap represents the eme!- 
gence of a clerical ability which reflects 
an emphasis on training for accuracy 
and speed that is not necessarily 
stressed to the same degree in thé 
academic program. But it is exactly 
these abilities which are important t? 
good performance on the N subtest an 

to a lesser degree on the V subtest. I 

our analysis is correct, then the supe 
Bey performance of the girls on the N 
subtest reflects their greater training 


on this type of task. Thi i 

¢ f task. This explanatio? 
1s consistent with the fonose! Itural 
interpretation uy si 


inten sually given gjmilat 


The writers recognize th: 
going asserti i 
but in the abana ao. page 

a 0 

put forth an idea that. mighemPted 0 
ina subsequent study, It m: be tested 
other interpretations will ay be that 
fruitful but in any event, wi cea more 
numbers of girls enrolleq ie h the large 
courses, it would seem in, Commercial 
some attempts be maq Portant, that 
our findings. & to replicate 


A LONGITUDINAL STUDY OF MENTAL ABILITIES 


on. Certainly the test-retest cor- 
a ions for the Factors V, R, and N 
i a high to permit cautious 
dee -% counseling. This is fortunate 
hight lose factors correlate most 
don pee achievement. The Factors 
bceenshes _ quite unstable and appear 
a 5: ute little towards the predic- 
ree! uture behavior. It should be 
i ho that despite the claims 
weeds as Vi subtest is still the best 
el or of achievement, is the most 
Pee Mote time, and, importantly, 
beaks ny highest with the other sub- 
rently “ Se total score both concur- 
ceo & 3) and long term (.64). The 
ho: est, predictor of achievement is 
beset seore (.47) which is also the most 
een (.82). ‘Though long term 
dsidien ency in relative position is satis- 

ae the same cannot be said for 
mined, oa of differences between 
PMA s. Profiles as derived from the 
in eee appear to have little value 
seling rential prediction or for coun- 
Gee On the basis of the 
on ie ae reporting sex differences 
dent MA subtests it seems quite 
Bice nat separate norms for boys and 
ich re needed if the test is to be used 
call ¢ eens Tinally we should like to 
fas als urther research on the reasons 
isto ne sex differences and for 
or es validation studies employing 
ee es other than those pursuing 
sd oe programs in the high schools 

olleges. 


SuMMARY 


6 a sample of 100 eighth graders was 
kK ministered the Primary Mental 

bilities Test, Intermediate Form, and 
Was readministered the same test as 
they were concluding the eleventh 
grade. Several analyses of the resulting 
data were performed which showed 
that relative position on the PMA sub- 
tests is fairly well maintained over 
time but that differences between sub- 


59 


test scores are quite unstable. Though 
no sex differences were noted at Grade 
8, there were significant differences in 
favor of the girls at Grade 11 on the V, 
R, N, and W subtests. In addition to 
these sex differences it was noted that 
girls were less consistent in perform- 
ance than the boys and their scores 
correlated less well with achievement, 
as measured by the Myers-Ruch High 
School Achievement Test. The Verbal 
Meaning subtest predicted achieve- 
ment most effectively, with the Total 
score ranking second. Analyses were 
also made relevant to the problem of 
the structure of intelligence and the 
degree to which specific factors emerge. 
The absence of evidence for increased 
differentiation of abilities as noted 


from Grade 8 to Grade 11 and the 


highly correlated second-order factor 


was viewed as support for Vernon’s 
hierarchical-structure theory of intel- 


ligence. 


REFERENCES 


ychologi- 


AwastTast, ANNE. The nature of ps. 
127- 


cal ‘traits.’ Psychol. Rev., 1948, 55, 
138. 
Awastast, ANNE, & DRAKE, J. D. Anempiri- 
eal comparison of certain techniques for 
estimating the reliability of speeded 
tests. Educ. psychol. Measmt., 1954, 14, 
529-540. 
Ascu, S.A study of change in mental organi- 
zation. Arch. Psychol., 1936, No. 195. 
Carrett, R. B. Factor analysts. New York: 
Harper, 1952. 
Dorrett, J. E., & BENNETT, G. D. A longi- 
1 study of the differential apti- 


tudina 
tude test. Bduc. psychol. Measmt., 1951, 
11, 228-237. 

Garrett, H. E. ‘A developmental theory of 
intelligence. Amer. Psychologist, 1946, 
1, 372-378. 


Garrett, H. E., BRYAN, Anicr I., & PERL, 
R. The age factor in mental organiza- 
tion. Arch. Psychol., 1935, No. 176. 

HerzBerG, F., & Lerxin, M.A study of sex 
differences on the Primary Mental Abili- 
ties Test. Hduc. psychol. Measmt., 1954, 
14, 687-689. 

Meyer, W. J- The stability of patterns of 


primary mental abilities amorg junior 


60 


high and senior high school students. 
Educ. psychol. Measmt., 1960, 20, 795- 
800. 

Pinzxa, C., & Saunpers, D. R. Analytic 
rotation to simple structure: II. Exten- 
sion to oblique solutions. Princeton, 
N.J.: Educational Testing Service, 1954. 

Scnitier, B. Verbal, numerical, and spatial 
abilities of young children. Arch. Psy- 
chol., 1934, No. 161. 

Sunn, E. O. Interest and intelligence as 
related to achievement in tenth grade. 
Calif. J. educ. Res., 1956, 7, 217-220. 

Spearman, C. The abilities of man. New 
York: Macmillan, 1927. 

Tuurstong, L. L. Primary mental abilities. 
Psychometr. Monogr., 1938, No. 1. 

Tuurstone, L. L., & THurstone, T. G. 


WILLIAM J. MEYER AND A. W. BENDIG 


Factorial studies of intelligence. Psy- 
chometr. Monogr., 1941, No. 2. 

Tuurstone, THELMA G. Manual for the SRA 
Primary Mental Abilities. Chicago: 
Science Research Associates, 1958. 

Tyier, Leona. The stability of patterns of 
primary mental abilities among grade 
school children. Educ. psychol. Measmt.» 
1958, 18, 769-774. . 

Vernon, P. The structure of human abilities. 
New York: Wiley, 1950. 

Weiiman, F. E. Differential prediction of 
high school achievement using single 
score and multiple factor tests of mental 
maturity. Personnel guid. J., 1957, 35 
512-517. 


(Received August 22, 1960) 


JOURNAL OF 
EDUCATIONAL PSYCHOLOGY 


Volume 52 


April 1961 


Number 2 


SEX ROLE AND COMMUNITY VARIABILITY 
IN TEST PERFORMANCES’ 


CARSON McGUIRE* 
University of Texas 


In the Human Talent Project, which 
has dealt so far with boys and girls in 
their junior high school years, one ma- 
jor problem has been the identification 
of suitable measures which combine 
effectively to explain and forecast tal- 
ented behavior among adolescents 
(Hindsman & Duke, 1960). The pro- 
posed psychological model for research 
a human talent (McGuire, 1960) pos- 
eee iis categories of variables. 
a ach kind of behavior to be explained 
i predicted is, in large part, a func- 
ei of (a) potentialities of an indi- 
= ual pertinent to that behavior, (0) 
cain regarding supportive or 
nonsupportive responses of self and 
ee and (c) pressures imposed upon 
i e person by parents, age-mates, and 

eachers. The pressures form a part of 
what Goethals (1958) terms the “con- 
text” in his framework for educational 
research. Not only the interpersonal 
but | also multipersonal situations or 
institutional settings have to be taken 
into account. 

1The research reported herein was sup- 
ported through the Cooperative Research 
Program of the Office of Education, United 
States Department of Health, Education, 
and Welfare. 

2 The author is indebted to Harl Jennings 
and F. J. King (now at Florida State Uni- 
yersity), research associates of the Human 
ak Project, and to Kathleen Silva, sta- 
ia isa clerk, Laboratory of Human Behav- 
the Se valuable assistance 1n carrying out 

umerous computations for this re- 
Search. 


61 


This paper reports a series of tests 
of the proposition that sex role and 
school location have a moderating in- 
fluence upon the performances of jun- 
ior high school students on cognitive 
and noncognitive instruments. Each 
of the test variables has been selected 
as a possible measure of the poten- 
tialities, expectations, or pressures. if 
the results support the proposition, the 
model should be modified by two 
kinds of moderator variables. One is 
variability of certain test variables in 
terms of sex-role jdentification and 
the sex-typing of socialization pres- 
sures upon boys and girls (More, 
1953). The other represents differences 
in community context and patterns of 
educational experiences from one loca- 
tion to another with accompanying 
variations in what is expected and 
what may be learned (Ferguson, 
1954). Indirectly, the series of studies 
permits an evaluation of the useful- 
ness of the context variables suggested 
by Goethals (1958). More directly, 
the results provide some necessary 
data about relations among variables 
which may be used to define the di- 
mensions of an talented 


behavior. 


d to predict 


MetHoD 


62 CARSON 


ences, however, provided data upon the 
complex interplay among variables en- 
countered when subjects of both sexes were 
drawn from schools in several communities 
and when they represented different family 
backgrounds and levels of mental function. 
Therefore three different tests of the prop- 
osition were designed, each using a multi- 
variate approach made _ possible by pro- 


graming analyses for a modern high speed 
computer. 


Analyses of Variance 


The first design made provisions for two 
other sources of variation in test perform- 
ance in addition to sex role and school con- 
text. One of these components was family 
status, representing variations in culturally- 
typed experiences and the socialization pres- 
sures of different family backgrounds in 
each community (Auld, 1952; McGuire, 
1953). The other was level of mental fune- 
tion, to test the interaction of cognitive 
functioning with the other three components 
of variance. Sex role (S), family status (F), 
mental function (M), and school location 
(L) were combined in a 2 x 3 x 3 x 4 
factorial design with two replications in each 
cell, The replications provided an error 
term with 72 degrees of freedom for testing 
main effects and interactions among the 
postulated sources of variation in perform. 
ances upon 22 cognitive and 22 noncognitive 
test variables. 

Subjects. The subjects were 144 junior 
high school students, 72 boys and 72 girls, 
drawn by random procedures to fit the re. 
search design from a total population of 
1,417 who had responded to each of the 
tests in the seventh grade. Each of the 
four communities, designated A, B, C, and 
D, participating in the Human Talent Proj- 
ect was represented ry 36 upsets. Sa 

and girls, equal numbers were selected 
oe high (HFS), middle (MFS), and low 
(LFS) status homes as determined by an 
index of social status for each family 
(McGuire & White, 1955). Each subsample 
was subdivided for level of mental function, 
the measure of intelligence being the Cal- 
ifornia Test of Mental Maturity (CTMM), 
Junior High Level, Form S, 1957. Based 
upon standard deviations from sts iaee 
for the total population of Anglo-, Beto, 

q Latin-American students, the subdivi- 
ie were IQ 113 and above (HMF), 84 to 
112 (AMF), and 83 or less (LMF). ' . 

Cognitive variables. Among the lear 
tive test variables, 11 were mea: 


McGUIRE 


achievement during the seventh ee 
Reading, language, and arithmetic mee 
ment were assessed by the ome’ h 
Achievement Tests (CAT), Junior ‘e 
Level, Form W, 1957. CAT Reading n- 
two subtests, Vocabulary and Compre 
sion; CAT Language had subscores a 
Mechanics of English and for Spelling: fa "i 
CAT Arithmetic combined Fundamentt 
and Reasoning. The remaining A saat 
instruments were STEP Social Sind S s 
STEP Science from the Sequential a = 
Educational Progress, Cooperative 
Service, Form 3A, 1957. 

The other 11 cognitive tes 
measured the cognitive, perceptual, an 
psychomotor potentialities of each boy ten- 
girl in the seventh grade year. STEP ree 
ing, which required comprehension of yee 
Sages read aloud, measured efficiency 12 Ap- 
apprehension of yerbal stimuli. Clerical ware 
titude and Mechanical Reasoning W of 
chosen from the well-known battery ym 
Differential Aptitude Tests (DAT), ‘alt 
A, 1947, Vocabulary Completion and ese 
Transformation, the latter to estimate ® 
ity to shift the function of a part of o 
object and use it in a new way, were tone 
from a battery of factor tests used by oy 
ford, Wilson, Christenson, and Lewis (1951)- 
Mutilated Words, Gestalt. Completio?: 
Short Words, and Copying were selecte 
from a kit of reference tests, supplied 4 
the Educational Testing Service, to evaluat? 
various aspects of perceptual closure. Dot 
ting and DRT, to measure psychomot? 
Speed and discrimination reaction time, wore 
coarehe Pesta Permission of the ee 
lana AS Foi oe ett Command, Lé 


. Oncognitive variables, The 22 noncogn! 
tive Instruments wer 


designed to assess 
elements of 


+ variables 
aD! 


foore and 


Holtzman (195 
wide survey, a 


provided scales on 
y tensions, negative gia Bie fam 
authoritarian discipline, perso, i) Tentation 
ment, criticism of education Maladjust- 
youth, social inadequacy, > Criticism 0! 
guacy. The children’s form of" 

Anxiety Scale (CMAS), ag th 
tenada, McCandless, and p. 


T a state- 


by Cas- 


alermo (1956), 


SEX ROLE AND COMMUNITY VARIABLES 


was modified for administration to adoles- 
eae a reliability 90 for 41 items). 
Stud y the Brown-Holtzman Survey of 
nae : hae and Attitudes (SSHA) was 
Bran 2 for use in secondary schools (odd- 
‘tae reliability 95 on 55 items keyed for 
gina rie girls). Thus modified, the SSHA 
ta ed two test variables, Scholastic Mo- 
vation and Teacher Valuation. 
eR ee of data. Distributions of scores 
sae 44 test variables were in the form of 
wali _! values 1 to 9 with 5 as the mean 
pe 3 the total population of 1,417 stu- 
inte = he transformation of raw scores 
Rs anine values not only permitted 
the Unie use of the IBM 650 facilities at 
Maced | ersity Computing Center but also 
oe woe and homogeneity of var- 
hovarsd os for the sample population 
Séinewh : out a stanine value of 5 with 
mola larger deviations than the total 
i ion since the subjects represented 
rik es of family status and mental func- 
a as well as boys and girls in the middle 
4 from each community. 
cue ey ka analyses of variance were carried 
aie 4 IBM 650 computer and F tests 
eats ara to identify significant main ef- 
figs pe Interactions. Comparisons were 
which ee the deviation term (SFMLP), 
lowaRaeA 72 degrees of freedom, unless a 
the 01 der interaction was significant at 
tiSrloms or .05 level of confidence. Then in- 
puted f correlation coefficients were com- 
thie or each significant mean square in 
resulting tables, employing the formula. 


MS, — MSw 


n= 
MS, + (n — 1I)MSw 
where: 
i = mean square for “among groups” 
w mean square for “within groups” 
nm = number of subjects in each group 


oud using the deviation term (SFMLP) as 
MS. for each computation. The intraclass 
correlation coefficients were regarded as 
Measures of the degree of “resemblance” 
among subjects in the same sex role (8), 
family status (I), level of mental function 
(M), school location (L), or sharing some 
combination of these attributes. 


Factor Analyses 

The second approach to testing the 
Proposition that sex role and school loca- 
tion formances On cognitive 


influence per! ‘ 
and noncognitive tests involved a series of 


63 


factor analytic studies to map out relation- 
ships among yariables. By this time, the 
remaining factor tests from the Guilford 
battery had been scored for the total pop- 
ulation of 1,417 males and females in the 
four school locations. Thus stanine values 
for Unusual Uses, Consequences, Common 
Situations, and Seeing Problems were added 
to the master deck along with subject 
grades assigned by teachers for the year 
and grade point averages (GPA). 

In each analysis, correlations among the 
variables provided a matrix for the extrac- 
tion of centroid factors which were rotated 
to an orthogonal normal varimax solution 
(Kaiser, 1958) by appropriate program- 
ing. The varimax criterion for analytic 
rotation was elected since it had the great- 
est likelihood of portraying factors invari- 
ant under changing samples of tests and 
populations. For this study, analyses were 
carried out for each sex role (m, f) and for 
each community (A, B, C, D) in addition 
{o the one for the total population. 


Multiple Regression Analyses 


The third step involved multiple re- 
gression studies with selected criteria of 
academically talented behavior as depen- 
dent variables. These criterion measures 
were GPA Teacher Evaluation, CAT Read- 
ing, CAT Language, CAT Arithmetic, 
STEP Social Studies, and STEP Science. 
Separate analyses were carried out for each 
sex (m, f) in each of the four communities 
(A, B, C, D). The cognitive and noncogni- 
tive variables which showed promise as in- 
dicators of talented beh: 


avior were retained 

as dimensional or independent yariables. In 
addition, CTMM Mental Function and Iss 
Family Status were added to represent dif- 
Jlectual functioning and in 


ferences in inte’! 
family background, respectively, which were 
shown to be involved in the interactions 


found in the initial variance analysis. 
Rhymes, another of the Guilford et al. 
(1951) factor tests to measure word fluency 
or verbal facility, also was included. By 
this time, the meanings of 46 kinds of de- 
scriptions age-mates made of one another 
in response to nomination items had been 
sorted out by factor analytic and factor 
matching techniques (Hindsman, 1960). 
Thus five sociometric factor variables com- 
mon to boys and girls were included in the 
regression studies: namely, Peer Accept- 
ance, Absence of Negative Model Value, 
Social Effectiveness, Nondeviant vs. Devi- 
ant Behavior, and Quiet Dependeacy. 


64 CARSON 
An iterative technique (Greenberger & 
Ward, 1956) was employed for the multiple 
regression analyses. The technique, which is 
a modification of the Kelly-Salisbury 
method, was adopted since the independent 
variables were so numerous. Using the IBM 
650 computer, iteration was carried out to 
that point in the program where the sums 
of squares of regressed values, R*, were not 
raised more than a specified criterion value 
of .0005 in the analysis. The stop criterion 
provided a solution which avoided overfit- 
ting the regression line. This solution was 
regarded as one which would have minimum 
shrinkage upon application of the regression 
weights to subsequent samples. Variables 
with regression weights of zero at the con- 
clusion of the interations were regarded as 
linear combinations of those with weights. 
Thus it was not necessary to specify in ad- 
vance the subsets of independent variables 
which were linear combinations of the others. 
This was regarded as an important justifica- 
tion for the method employed. 


REsutts 


The outcomes of the three studies 
are summarized in a series of tables 
which clearly show variability in test 
performances between boys and girls 
and from one community to another. 
Many of the findings in the variance 
analyses confirm what is already 
known. What is new largely involves 
the significant interactions. The factor 
analytic and multiple regression stud- 
ies demonstrate the consequences of 
the sex-typing of socialization pres- 
sures and the influences of school lo- 
cation, or community context, upon 
ways in which variables combine to 
map out different kinds of behavior 
and to explain certain kinds of talent 
valued in junior high schools. 


Variance in Test Performances 


Table 1 shows results of the anal- 
yses of variance for performances of 
144 subjects on the 22 cognitive test 
variables. In a similar manner, Table 
2 gathers together the results for the 
22 noncognitive measures. Instead of 
the usual mean squares, the entries 


McGUIRE 


are interaclass correlation coefficients 
which measure the resemblance 
average degree of similarity among 
persons classified alike. For example, 
in Table 1, the coefficient .49 in the 
column for mental function (M) and 
row for CAT Reading Achievement 
has replaced a mean square significant 
at the .01 point in the original analysis 
of variance. Reference to tables of 
means, not shown in this report to 
conserve space, reveals mean stanine 
values of 7.9 for HMF, 5.2 for AMF, 
and 4.3 for LMF subsamples. The ¢- 
efficient, 7; = .49, represents the de- 
gree to which 48 subjects in the same 
category of mental function are alike 
in performances on the standard test: 
Each entry in the two tables may b¢ 
interpreted in a similar manner. 
Cognitive tests. As one would ex” 
pect, Table 1 shows that, without eX- 
ception, mental function is a source 0 
variation in performances on achieve~ 
ment, cognitive, and psychomotor 
tests. DAT Clerical Aptitude (spee 
and accuracy), Gestalt Completio® 
(visual figural recognition), and Dot- 
ting (psychomotor speed), howevel 
“ppear only in interaction with othe? 
components of variance. In general, 
when the Coefficients of resemblanc® 
x low, 4s in the case of Mechanic® 
tion ne and Discrimination Reac- 
LMF oi the subjects in AMF and 
another than we ee a 
students whe ey are similar to HM 
ances exceed their perform- 
Independently of mental 
amily status clearly is g eye 


variation in some cognitiy, 
= The HFS boys ty P aa 
le advantages of upper e tls hav’ 
dle, and middle-middle ‘al Pper-mid- 
backgrounds. They resemble® family 
other more than they qo © one an- 
AFS or LFS subjects in either the 
pects of reading, in Mechan ous as- 


les of Eng- 


SEX ROLE AND COMMUNITY VARIABLES 


65 


TABLE 1 


3 
SIGNIFICAN' 
nv IntRacLASS CORRELATION COEFFICIENTS FROM ANALYSES OF Vv. 


ARIANCE IN 


Coanittve Perrorwances oF 144 Junior High STUDENTS 


Source of Variation® 


Cognitive Variable 


= Ss F M L Interaction 
T Pending Achievement os**  49** 
acon id 05* 44** — 05* 
car prehension os** 39** 
Mane: Achievement 25** 41** = 09** 
ation of English ist* ogee 45** = 13** 
CA’ fe ing ; 18** 29** 
au Arithmetic Achievement 40** — 20** 
ee og** 31** = SFML = 36* 
8 soning 
TEP Rostal Studies ae FML = 22* 
Tighene o5*  —-:19** FML = 29** 
DAT Stening 31** FML = 24* 
ieee 34** SFML = 31* 
Voom Reasoning 19** 1is* 
etal, comietion o7* 49** —O7* 
oa 1g FML = 31** 
ords bad 
Shont *yComPletion SML = 19* 
Copyin ords 16** 15** o7* 
Potting 10e* = 14** SML = 21* 
7 =I" 
RT Reaction Time a 07* acl 
eee within group, n 72 48 48 36 
1 2 2 3 


Note,—} 
Rounded and decimal points omitted. 


a 
Sex . 
role (8), family status (F), CTMM mental function (M), 


Stoup, ML 
= 12, FML = bs 

fn <06, , FML = 4, SML = 6, SFML = 2. 
P<.01. 


lish, ; ; . 
in an sobving Ie pee problems, and 
curately, i copying Sgures s0- 
en role apparently has a mod- 
of ¢o & influence upon certain kinds 
tae gnitive test, performances in the 
Teas eg of junior high school. Girls 
in mble one another and excel boys 
th ig aspects of language achieve- 
sect in the speed and accuracy with 
6 ich they cope with clerical tasks, 
identifying short four-letter words 
embedded in rows of letters (speed of 
Perceptual closure); and in psycho- 
Motor speed measured by a dotting 
test. In addition, the girls appear to 
have somewhat greater verbal com- 


school location (L); number of subjects in each 


prehension, as measured by Vocabu- 
lary Completion, and Discrimination 
Reaction Time than the boys in the 
seventh grade. On the other hand, 
boys out perform girls in reasoning 
through pictorially presented mechan- 
ical situations. 

Differences in test performances 
traceable to varying patterns of edu- 
cational experiences from one commu- 
nity to another are most apparent 
in the case of achievement in arithme- 
tic. In terms of the sample populations 
drawn, subjects from the two Gulf 
Coast communities, C and D, excel 
those of the North Central cities, A 
and B. The variability between pairs 


66 


of locations appears to be a function 
of mastery of arithmetic fundamen- 
tals, not ability to reason numerically. 
In language achievement, subjects in 
Community C have the highest mean 
scores and those in A the lowest, B 
and D being similar. The differences 
may be traced to the relative mastery 
of the mechanics of English language 
by students in the four locations. 
Moreover, the small but significant 
variations from place to place in speed 
of perceptual closure, as measured by 
Short Words, parallel the perform- 
ances on language tests. On the other 
hand, the communities rank C, A, D, 
B on the two vocabulary tests vari- 
ables. 

Each of the significant interactions 
in Table 1 involves school location as 
one of the components of variance, 
and four of the nine reflect sex-typed 
influences. In two instances, mastery 
of the fundamentals of arithmetic and 
clerical speed and accuracy, pairs of 
boys and girls with similar intelligence 
and family backgrounds in the same 
community (SFML) resemble each 
other significantly. Performances on 
the three STEP tests, Social Studies, 
Science, and Listening (cognitive ap- 
prehension), as well as Gestalt Trans- 
formation (conceptual redefinition) , 
vary from community to community 
according to family background and 
quality of mental function (FML) of 
the junior high students. Aspects of 
perceptual closure, measured by Ges- 
talt Completion and the Copying test, 
also vary from location to location, 
but this time in terms of sex-typed 
learning experiences and intellectual 
functioning (SML). Finally, psycho- 
motor speed, measured by the Dotting 
test, has a different relation to pee! | 
of mental function from one p < 
another (ML) independently o fhe 
tendency of girls to excel boys in 
four ecmmunities. 


CARSON McGUIRE 


Noncognitive tests. The analyses of 
noncognitive performances summa- 
rized in Table 2 refer only to self-re- 
port or paper-and-pencil tests. = 
sponses of seventh grade students p 
noncognitive instruments apparently 
are less frequently influenced by ir 
iations in intellectual functioning a 
performances on cognitive tests. 

48 subjects in the average or +» 
subsample resemble one another 
having more nervous tension (: a8 
2) and reported anxiety symptorn 
(CMAS) than those of either high : 
low intelligence. Thus the relation, A 
these two variables to mental oo 
(CTMM) is curvilinear rather we 
linear. The relation of a preference sa 
authoritarian discipline (CYS) to at 
telligence also is somewhat curviline 
mean stanine values being 4.6 the 
HMF, 5.4 for AMF, and 5.2 for 
LMF subsamples. ‘ 

Only in se instances does tamil 
status have an independent effect eae 
distributions of scores obtained i i 
the self-reports. The subjects nr 
MFS homes, lower-middle and mob!” 
upper-lower class, resemble one ay 
other in having less independent ed 
inance (JPQ 9) than the 48 subje¢ 
in either the HFS or LFS subsamplet 
As expected, the lower class boys 
girls in the LFS subsample tend to ¢*” 
press a negative orientation to society 
all Series are alike in being p0 

A number of sex-typed difference® 
may be noted j ah 
igtaion Gite ina pee seme 


the sample Ja 
I pu 
tion represent themselves a “being 


in 


&ts positively: 
S Tesemble on¢ 


SEX ROLE AND COMMUNITY VARIABLES 


67 


TABLE 2 


SIGNIFICANT INTRACLASS CoRRELATION COEFFICI 
in Noncocnitive Pprrormances or 144 JUNIOR Hic 


ENTS FROM ANALYSES OF VARIANCE 
H STUDENTS 


Source of Variation* 


Noncognitive Variable 


s F M & Interaction 
JPQ 1 Sensitivity/Toughness 48** - > 
2 Tension/Relaxation or" eae 
3 Emotional/Stable 
4 Control/Casualness 
5 Impatient Dominance 06* 
6 Sociable/Withdrawn 
7 Adventurous 
8 Socialized Morale 20** 
0 a parole Dominance 13** 05" _ 
i apemen Conformity 
cys a: urgency/Desurgency 24e* 
amily Tension SFML = 62** 
Negative Social Orientation 10** 
pe uloritertan Discipline 06* 
as Maladjustment SFML = 40* 
riticism of Education aah = 
Criticism of Youth 
Social Inadequacy a aoe 
Self sini = 2 
elf Inadequacy ML = 80" 
oa eK 
Solas Anxiety OOF nie 
7 Scholastic Motivation a oP 
Teacher Valuation 05* ae 
z. een 
Subjects within group, 7 72 48 48 ee 
1 2 2 8 


Note,— 
ie Rounded and decimal points omitted. 
x Tole (S), ISS family status (F), CTMM mental 


* 
wae < .05. 
P< .O0L. 


rir more than girls in being tough- 
ackne more serious or desurgent, re- 
pa & negatively to learning tasks 
ti school authority, independent, 
i patient, and not so appreciative of 
heir teachers, Sex role, family status, 
and mental function (SFM) combine 
to influence responses to the CYS 
Scales for criticism of education and 
feclings of self-inadequacy- In terms 
of the self-reports, the eight lower 
Class girls of high intelligence, two 
from each community, are least criti- 
cal and feel least inadequate. Most 
critical of the school and its expecta- 


1 function (M), school location (); 
2 


&rou ‘al 
P, SL = 16, FL = 12, SFM = 8, FML = 4, SFML = 2. 


number of subjects in each 


tions are boys of low intelligence from 
lower-middle and mobile upper-lower 
class families. Feelings of self-inade- 
quacy are highest among lower class 
boys who also are low in mental func- 
tion. 

School location is an independent 
source of variation in only one set of 
responses to the noncognitive instru- 
ments; namely, independent domi- 
nance which is highest among the 36 
subjects in Community C and lowest 
in D. On the other hand, 9 of the 11 
ant interactions in Table 2 in- 


signific 
bility from one community 


volve varia 


68 


to another. The most interesting ones 
are representations of social inade- 
quacy (SL, FML), self-inadequacy 
(FML), and family tension (SFML), 
all departing from zero at the .01 level 
of confidence. For example, the 18 
boys in Community B express the 
greatest social inadequacy (SL). The 
least social and self-inadequacy on the 
CYS scales is represented by the four 
lower class subjects of high intellec- 
tual calibre in Community A (FML). 
Pairs of boys or girls with similar 
level of mental function and fam- 
ily status in the same community 
(SFML) have a remarkable resem- 
blance in CYS scores for family ten- 
sion (ry = .62). 

None of the cognitive test variables 
and only six of the noncognitive in- 
struments remain uninfluenced by one 
or more of the components of variance 
included in the research design. They 
are JPQ 3, 4, 6, 7, and 10, together 
with CYS values for criticism of 
youth. Of these, only JPQ 3 (emotion- 
ality vs. stability) and 6 (sociable ys, 
withdrawn) later proved to be promis- 
ing indicators of talented behavior 
(McGuire, Hindsman, Jennings, & 
King, 1961). As a main effect or in an 
interaction, there are sex-typed dif- 
ferences in performances on 12 of the 
22 cognitive tests and on 13 of the 
22 noncognitive measures. Similarly, 
there are either independent or com_ 
plex variations in responses from one 
community to another for 15 of the 
cognitive and 9 of the noncognitive 
variables. 


Factor Analytic Studies 


Table 3 presents a sample of perti- 
nent loadings for cognitive and non- 
cognitive variables from the factor 
analytic studies employed to sort out 
abilities and other attributes of the 
seventh grade population and a num- 


CARSON McGUIRE 


ber of subpopulations.* The table a 
arranged so that magnitudes of — 
ings for the two sex roles (m, f) “3 
among the four communities (A, ¥ 
C, D) may be compared for ag 
(academic grades) and Factor pea 
cially oriented achievement mot fon 
tion). The number of aoe ee 
each analysis is entered at the ing 
of each column in the table. ia ‘ 
convention, loadings of .30 or oer ing 
are entered for the variables de “ih F 
each factor. The letter a denotes the 
highest loading of a variable for 
population studied. 
Sex roles. As shown in Table 
(deposited with ADI), the paper-2h" 
Pencil test responses and teacher ¢ ig 
uations of the 1,417 junior a 645 
school students, 772 males an evel 
females, can be mapped out into sind 
factors. Factor matching of - 
magnitudes (Cattell, 1957, PP: 


se 
827) indicates a reasonably a 
agreement (correlations of - the 
greater over all variables) in for 


case of five of the seven factors fa 
boys and girls, The factors bee 
to both sexes have been provie’ 
ally named Scholastic Achieve A 
(largely defined by standard tests fe 
achievement), Academic Grades (* , 
fleeting the teacher evaluations of per 
formances in the seventh grade); - 
vergent Thinking (three of the fou 
Proposed tests of creativity), Ineffec- 
tively Functioning Personality (self- 
reports of maladjustment, inadequacy? 
tension, and anxiety), and Soci® 


SEX ROLE AND COMMUNITY VARIABLES 


69 


accere % TABLE 3 
— or Cognitive AND NoNncoNGITIVE VARIABLES FOR GRADE 
‘COPULATION, BY SEX RoLEs, AND COMMUNITIES 


mais Grade Sex Roles Communities 
vil m f A B fe) D 
: Factor B: Academic Grades 
PA Teach i 
T er Evaluation Qa a a 7 
Toe Social Studies = Ena oa Loa im > i 
sailor dics | eel oe ee | — le 
t Mathemati a 3 i = f ee 
oan Arithmetic re s oF ce Ge ae as ioe 
CAT Language = go | 33 | — | 32 | Gl 
P Social Studies a _ “8 = = a 
Reading = ~ _ _ aa ae 42 
Daw pene oll ban bee =e ae Pen ae 
3hG.g eee ine an =n Pe eee ee bd 
3 Emotional/Stable = = = 31 ~~ _ aoe 
: Factor D: Socially Oriented Achievement Motivation 
PQ 8 Sociali 
alized Morale 
9 : 715 729 75 a a 
Jpg 3 Taepeldent Dominance —578 | —525 | —529 _ 2 a ie 
SSHA at ree rend 7530 | —4g= | —52° | —35° | —Go | —47* | —52° 
IPQ 6 So olastic Motivation 47 485 4g 38° 399 380 62" 
Gi te ee gas | 40°) 39° | 40e| 38° | 42°) 36° 
me ines = = = HS | Se 
IG is ee so | — | — | oo 2S) a) 
CYS N Surgent/Desurgent — | —36= | —36 _ = = = 
Tee Social Orientation = — — — | -44 — | -382 
eneh eacher Evaluation - — — — 31 — — 
Ys a Social Studies - = _ = 30 = 
s Ree Inadequacy — — = = 638 | — = 
ST aladjustment = = a — | om) = = 
‘amily Tension = e — — | cee | — = 
Po i 
pulation, N 1,417] 772 | 645 | 338 | 319 | 448 264 


Note.—Dect 
; see ‘Decimal points omitted. 
est loading for a variable in a factor structure. 


Alienati itari i 
sci = = lala and anti- 
he two factors with least agree- 

. oe the sex roles turn a to 
© cme Oriented Achievement 
‘iter ion (acceptance of school and 
ine f pscmener conformity, stabil- 
lore scholastic motivation) and 
- the “a Closure. Factor loadings 
€ former, Factor D, are shown in 


Table 3. Girls who have a positive 
attitude toward school and academic 
attainments are less anxious than boys 
and appear to be able to be critical of 
educational practices whereas their 
male counterparts are not. Elements 
common to both sexes on the percep- 
tual factor involve ability to cognize 
symbolic units (Mutilated Words, 
Short Words, DAT Clerical). For girls, 


70 CARSON McGUIRE 


this ability is linked with achievement 
on a language test; for boys, symbolic 
closure is tied in with visual figure 
recognition and psychomotor speed. 

Community variability. With one 
exception, Social Alienation, the fac- 
tors mapped out for the total popula- 
tion fail to match across the four 
communities. The factor structure for 
Community C, which had 488 boys 
and girls in departmentalized class- 
rooms of two junior high schools, is 
most like that of the total population, 
In the other school locations, Percep- 
tual Closure is absorbed either by 
Divergent Thinking (A and D) or by 
Scholastic Achievement (B). Com- 
munity A, which had 338 students in 
modified self-contained classrooms of 
one large junior high, otherwise shows 
a pattern which is reasonably similar 
to C and the total population. In Com- 
munity D, where 264 pupils were 
housed in self-contained classrooms 
of county schools as well as in a 
smaller departmentalized junior high 
school, Academic Grades appear to be 
more closely associated with perform- 
ances on standard tests of achievement 
than in Locations A and C. 

The factor analysis for Community 
B, where seventh grade students were 
housed in self-contained classrooms at 
five different elementary schools pend- 
ing the completion of another junior 
high building, yields a sorting of abili- 
ties and other attributes quite different 
from Locations A, C, and D. Only 
four factors are necessary to map the 
paper-and-pencil test responses and 
teacher evaluations of the 319 boys 
and girls. High loadings for the stand- 
ard tests of achievement and for 
teacher evaluations combine with two 
measures of divergent thinking (See- 
ing Problems, Consequences) and in- 
dicators of perceptual closure to form 

site factor which might be 
= oa ae Attainment. Thus for 
. B (Academic Grades), there 


a 


are no loadings shown for Community 
B in Table 3. Factor D (Socially | 
Oriented Achievement Motivation) ™ 
Table 3 for Community B ater 8 | 
what might be termed “effective 
vs. ineffectively functioning a 
ity,” a polar arrangement. of cain 
CYS, and similar noncognitive sc" 


e 2 4 > = jml- 
Only Social Alienation remains 5! | 
oa Es = | 
lar to the other communities. 


Regression Studies 


: -egtes* 
Table 4, which indicates the at 
sions of CAT Language Achieve™ 


s es i stive an 
in Grade 7 upon certain cognitis "16 in) 
noncognitive variables by sex 10 


each of the four communities, serves : 
illustrate the results of the ae 
studies. Only the beta weights — 
variables which illustrate betwee? . | 
and cross community variability a | 
effectively are shown. Similar vere 
ability of beta weights for variabl™ 
between the sexes from community, pn | 
community occurs when the eriterie; 
measures are CAT Reading, CA 
Arithmetic, STEP Social Studies | 
STEP Science, and GPA Teache? 
Evaluations When GPA Teachel | 
Evaluation is the dependent variables 
the sociometric factor variables a 
some of the noneognitive indicat! 

of talented behavior have higher loac™ 
ings than they do for standard tes! 
of achievement used as criteria. Al 
though the loading magnitudes for th® 
Set of variables vary with each crite 


‘ 

chown of six tables (Tables B to G) ; 
7 OWing regressions of CAT Reading, cAT 
Suneaee, CAT Arithmetic, STHp social 
Studies, STEP Science, and GPA Teache! 
Valuation by community and gox ié pas 
fen deposited with the Americ, ae cit” 
Mentation Institute. Order sei can 5 
6610 from ADI Auxiliary Bonne No ff 
Project, Photoduplication Servier blicatiom 

of Congress; Washington 25, e, Library { 
in advance $1.25 for Microg)~” remitting / 
for photocopies. Make cheg ka ™M or $1.25 
Chief, Photoduplication Seryigg? ¥able to: 


Congress. > Library of 


' 


SEX ROLE AND COMMUNITY VARIABLES 


TABLE 4 


Recre ‘ 
SSIONS OF CAT Lanauacr AcHIEVEMENT IN Grape VII upon Cognitive 
AND Noncoenitive VARIABLES BY ComMUNITY AND Sex RoLe 


Note,— 
Soa Sao to two places, decimal points omitted in lower porti 
ights for all 32 variubles are shown in Table C, deposited wit 


Pris Bi multiple correlation coeffi- 
ise ea uniformly high and range 

The : 0 86 (McGuire et al., 1961). 
Table Piet gral point to be noted in 
BtoG. = in the complete Tables 
Hie ih (deposited with ADI), is that 
shigh Bi of independent variables 
denied ds _ the maximum multiple 
i OE ie with the criterion measure 
io aneth = same from one community 
at each ~s nor for males and females 
each coly the school locations. In 
the Beis ng of the complete tables, 
iba. Itted variables (which have 
as Tk eights of zero) can be regarded 
lo Incar combinations of those with 

ading magnitudes for the particular 
subpopulation. Thus the regression 
Studies supply further evidence that 
the sex-typing of socialization pres- 
sures and the context of educational 
experiences apparently do have quite 
an influence upon ways in which cog- 
nitive and noncognitive variables rep- 


Community 
Variable® x 3 é = 
ae m f m f m f m f 
pone correlation, gi | .s2 | .82 | .84 | .8L | -83 | .75 81 
ple population, V 161 | 157 94 | 112 | 266 | 241 | 113 “98 
Cc / = = 
are Mental Function 39 30 28 17 33 20 41 29 
Conte ichanical o7 | os | 20 | —06 os | 19 | —12 
ee Transformation —08 21 06 06 08 
che wee —05 | —06 05 | —os | —12 | —09 11 | —12 
t otting s iS iu = 03 06 10 08 11 
i nm i 7 “N 
IPe 1 Sensitivity 07 05 | —06 a « 12 0 . 
PO 1 Social Morale —-13 03 —03 —08 ll 
SSH Ig beet ae oe o4 | 12 —03 —07 | 16 
No teetlastie Motivin | 05 | 09 | 19 05 | i | -i7 
Iss Todel Value 12 14 | —10 18 
'S Family Status 05 20 —14 
ion of table. 


h the American Documentation Institute. 


resenting potentialities, expectations, 
and pressures combine to explain vari- 
ous kinds of valued behavior. 


Discussion 


The series of variance, factor ana- 
lytic, and regression studies of sex 
role and community variability in 
test performances, as well as teacher 
evaluations and peer appraisals, con- 
firm some well-known findings and 
raise questions about others. For ex- 
ample, the variance analyses of sev- 
enth grade data fail to support the 
generally held conclusion that females 
excel in numerical computation and 
males are superior in mathematical 
reasoning and in science (Anastasi, 
452-504; Tyler, 1956, pp. 


1958, pp- 
247-275). Although the reputed ag- 
gressiveness of boys can be read into 


their greater tough-mindedness, in- 
dependence, and negative reactions to 
learning tasks and school authority, 


72 CARSON 
the seventh grade males are more 
serious (desurgent) than their female 
counterparts in this study. The higher 
emotional sensitivity and excitability 
(surgency) of the girls might be in- 
terpreted as symptoms of neuroticism 
and instability, but this is belied by 
their tolerance, socialized morale, and 
restraint. 

Taken together, the series of studies 
clearly support the proposition that 
sex role and school location have a 
moderating influence upon the per- 
formances of junior high school stu- 
dents on cognitive and noncognitive 
instruments. In the variance analyses 
more than half of the responses ap- 
pear to be biased by whatever is in- 
volved in learning a male or a female 
sex role. The consequences are more 
evident in the regression than the 
factor analytic studies, where five of 
seven factors showed a reasonable 
match between the sexes. Within com- 
munities, however, the differences be- 
tween males and females are quite 
marked where the same criterion 
measure regresses on different sets of 
independent variables with fluctuat- 
ing beta weights. 

Similarly, more than half of the 
distributions for cognitive tests, and 
nearly half for the noncognitive jn- 
struments, vary from one school loca- 
tion to another. These variations are 
illustrated most clearly by the differ- 
ences among communities in the mas- 
tery of fundamentals of arithmetic 
and mechanics of English. More im- 
portant, however, are the frequent 
interactions of school location with 
other components of variance. They 
would lead one to infer not only that 
patterns of educational experiences 
differ but also that more subtle in- 
fluences are being exerted by the four 
community contexts. These inferences 
are borne out both by the differences 
in factor structure, which parallel de- 


McGUIRE 


gree of departmentalization or orga" 
zation in terms of self-contaime 
classrooms, and by the regressi0 
studies. : F 
One of the unexpected findings Fe 
the frequency with which life style e 
independent of intelligence in influenc” 
ing behavior on cognitive tests; ee 
pecially standard measures of goer 
ment. The most probable explanat™ 
of the additive relation between 0 h 
class and mental function is that be 
family status is an index of the — 
of motivation and socialization em 
sures which maximize experiences "of 
volving reading, the effective ust AG 
English, and concern about scienti E 
knowledge. This interpretation ae 
lels a finding in another study (Me nd 
& Duke, 1960) that intelligenc? * 
SSHA scholastic motivation have ic 
dependent effects upon achieveme 
in arithmetic, reading, and sciencé, 
not language and social studies. iy 
_The foregoing findings posed & ve : 
difficult problem for the research tears 
They suggested that the sex-tyPiné,°” 
Socialization pressures and the a 
fluences of school location upon way! 
im which the proposed classes of var 
ables (potentialities, expectatio™” 
Pressures) combined were more } 
Portant than first postulated. If 8% 
to due combination of indicato”” 
tere and predict talented bebay 
males and ete Be worked qu A 
he altemative sso, . eh ae oe ne 
underlying factors ; o estima a 
Ts in persons (Catt 


7, Dp. 287-296: Gui ‘ 
4 3 lf P 
52 526) to attain aes f 


enn what Guilfor 
erms “a much more standard eile 
A seri : E 

Series of variang i 

an d_ regression studies ter ans I 
role atu ie TOPOsitj 2 eee es 

schosl location lon that §' a) 

é ave a mo | 

UPon the perform” } 


| 


erating influenc 


SEX ROLE AND COMMUNITY VARIABLES 


ope of junior high school students 
Og i and noncognitive instru- 
oft cei as potential indicators 
he ented behavior. Using a factorial 
ainl _ which involved 144 boys and 
epee four communities in their 
Biles oa grade year, analyses of vari- 
ae oe carried out for 22 cognitive 
pe noncognitive variables. These 
cl yses demonstrated many expected 
at ti unanticipated main effects 
he sex role, family status, 
Wore? unction, and school location. 
ioe ot they showed 20 sig- 
‘Pole interactions, 18 of which in- 
Mtoe variations in test behavior 
ig ne community to another. Then 
Mamie te of sex role and com- 
<a gta in test perform- 
eh rst identified in the variance 
Anal were demonstrated in factor 
Peng ic studies and multiple regres- 
analyses. 
in Tegression analyses revealed 
in icra kinds of behavior valued 
ia high schools could be ex- 
rangin Pi multiple correlations 
wenibi irom .75 to 86 by varying 
eee ns of cognitive and non- 
inde Ave variables. But these sets of 
i variables to be used as 
ie were different for each com- 
Hache y and for boys and girls within 
Sect school location. This develop- 
str 1 edges the finding that factor 
‘ihe ures varied for males and fe- 
is s, and even more so from one 
mmunity to another. 


oe REFERENCES 
ASI, ANNE. Differential psychology. 
wife Oe ed.) New York: Macmillan, 1958. 

D, By . Influence of social class on person- 

ality test responses. Psychol. Bull., 1952, 

Ca 49, 318-332. 

STENADA, A., McCanpuess, B. R., & Pa- 
LerMo, D. S. The children’s form of the 
Manifest Anxiety Scale. Child De- 
velpm., 1956, 27, 317-326. 


| 
Arrett, R. B. Personality and motivation 


\ structure and measurement. New York: 
World Book, 1957. 


ee 


73 


Fercvsox, G. A. On learning and human 
abies Canad. J. Psychol., 1954, 8, 95- 
112. 

Gorrnats, G. W. A framework for educa- 
tional research. Harv. educ. Rev. 1958, 
28, 2943. 

Greensercer, M. H., & Warp, J. H., Jr. An 
iterative technique for multiple correla- 
tion analysis. JBM tech. Newsltr., 1956, 
No. 12, 85-97. 

Gumrorp, J. P. Psychometric methods. (2nd 
ed.) New York: McGraw-Hill, 1954. 

Vefurrorp, J. P., Wison, R. C., CHRISTENSON, 
P.R, & Lewss, D. J. A factor-analytic 
study of creative thinking: I. Hypoth- 
eses and description of tests. Rep. Psy- 
chol. Lab., U. Southern Calif., 1951, No. 


_ 4. 

\finpsaan, E. Dimensions of adolescent be- 
havior. Unpublished doctoral disserta- 
tion, University of Texas, 1960. 

Hinpsman, E., & Duss, R. L. Development 
and utilization of talent. J. teacher 
Educ., 1960, 9, 107-116. 

Kaiser, H. F, The varimax criterion for 
analytic rotation in factor analysis. Psy- 
chometrika, 


1958, 23, 187-200. 

McBee, G., & Dux, R. L. Relationships 
between intelligence, scholastic motiva- 
tion, and academic achievement. Psy- 
chol. Rep., 1960, 6, 3-8. 

McGuire, C. Family and age-mates in per- 
sonality formation. Marriage fam. Liv., 
1953, 15, 17-28. 

vMcGuire, C. Talented behavior tn junior 
high schools. Final Report, 1960, Uni- 
versity of Texas, Project 025, Coopera- 
tive Research Program, Office of Edu- 
cation, United States Department of 
Health, Education, and Welfare. 

cGuire, C., HinpsMAN, E,, Jenninas, E., & 
Kina, F. J. Dimensions of talented be- 
havior. Educ. psychol. Measmt., 1961, 
21, 3-38. 

McGuire, C., & 
ment of social st 
Develpm., 1955, No. 3. 

Moore, BERNICE M., & Hourzman, W. H. 
What Texas knows about youth. Nat. 
Parent-Teacher, 1958, 53, 22-24. 

Morn, D. M. Developmental concordance 
and discordance during puberty and 
early adolescence. Monogr. Soc. Res. 
Child Develpm., 1953, 18(1, Ser. No. 
56). 

Tyrer, LEONA A. The psychology of human 
differences. (2nd ed.) New York: Apple- 


ton-Century-Crof' ts, 1956. 
(Received May 31, 1960) 


Wuurs, G. D. The measure- 
atus. Res. Pap. hum. 


Journal of Educational Psychology 
1961, Vol. 52, No. 2, 74-79 


SIGNIFICANT IQ CHANGES IN TWENTY-FIVE YEARS: 
A FOLLOW-UP! 


KATHERINE P. BRADWAY 
Stanford University 


The following constitutes a report 
of several statistical analyses of in- 
telligence test and family background 
data obtained in 1931, 1941, and 1956, 
on 110 members of the original pre- 
school standardization sample of the 
1937 Revision of the Stanford-Binet.? 
An initial report of the study has ap- 
peared previously (Bradway, Thomp- 
son, & Cravens, 1958). 

In 1931, 212 children in the San 
Francisco Bay Area, aged 2 to 5%, 
were given tests which later consti- 
tuted Forms L and M of the 1937 
Stanford-Binet. Careful methods of 
selection were adhered to in selecting 
these subjects, as they were the Cali- 
fornia sample of the nationwide stand- 
ardization of this revision (McNemar, 
1942; Terman & Merrill, 1937). Ten 
years later, the senior author admin- 
istered Form L of the Stanford-Binet 
to the 138 of these children who still 
remained in the area. 

In 1956, 110 of those tested in 1931 
and 1941 were located and given both 
Form L of the 1937 Stanford-Binet 
and the Weschler Adult Intelligence 
Seale. No interim contact had been 


1This investigation was supported by 
PHS Research Grant M-1273 (C2) from the 
National Institutes of Health, United States 
Public Health Service. ; : 

2 Formerly at Stanford University. 

®The authors are grateful to Maud A. 
Merrill and the late Lewis M. Terman for 
generously having made their standardiza- 
tion data available for follow-up; to Quinn 
McNemar, consultant for both follow-ups, 
who has been an invaluable advisor; and to 
Clare Thompson for her several contribu- 
tions to the handling of data and to their 
presentation. 


AND 


NANCY M. ROBINSON’ 
Chapel Hill, North Carolina 


made between 1931 and 1941, or be- 
tween 1941 and 1956. 1 
An important aspect of the 194 
study involved those 50 subjects whose 
test scores showed statistically signifi- 
cant increase or decrease since the 
original (1931) testing. Extensive 1° 
terviews with their parents were Col" 
ducted to assess the intellectual stm 
ulation in the home and to obtam 
data on the intellectual level of _ 
parents and grandparents. Significan’ 
differences between the groups Wel 
shown on an “ancestral index” an 
related measures which estimated th? 
intellectual abilities of parents a? 
grandparents (Bradway, 1945). - 
cluded in the 1956 testing were 21 0 
the 26 subjects whose IQs had show? 
the most increase between 1931 ee) 
1941 (1941 increase group”), 84 
21 of the 24 subjects whose 1Qs ha 
decreased most during that per? 
(“1941 decrease group”). It is thes? 
42 subjects who figure most promi 
nently in the analyses to follow, which 
were designed to determine whethe™ 
a change between preschool and junio? 
high School age is more likely to b? 
psi in adulthood by a change uF 
~¢ Same direction, the opposite dire¢ 
tion, or no change at all in the individ- 
ual’s status compared to the rest 0 
the group. A further question involves 
ie previous finding that a significant 
decrease in IQ was associated with 2 
Significantly lower intellectua] Joye] in 
the two previous generations th 


ad : . an was 
a significant increase inIQ “* 


SupsEects AND Procppy 


TI bjects in the pre ES 

he subjec' Sen, 

been administered Forms J, ang cae lise 
of th 


74 


1Q CHANGES OVER TWENTY-FIVE YEARS 


Revised § : 
haa Stanford-Binet Scale 25 years pre- 
standardinat; 1931, in connection with its 
seale in ide re and Form L of the same 
of the i941 (Bradway, 1945). A comparison 
with the tot ao group of 138 subjects 
original ‘Onin 633 subjects of like age in the 
showed ae States standardization group 
mean a selection. The initial 
109.9, Pa Ae 9 1Q of the retest group was 
lation group d with 105.4 for the standard- 
Qs } 
jects som for further study the 54 sub- 
corrections 1Qs had changed most, special 
Posite IQs te applied to the initial com- 
the Titec hese allowed for regression of 
mMeasureme ae IQ due to errors of 
equated fits = the initial (1931) IQ, and 
iets “4 and retest 1Qs in relation to 
Of these ae their distributions. The details 
radway a have been reported by 
home °vtaining information in 1941 about 
childern pierre not only were the 50 
and fathe erviewed, but also mothers of 47 
addition » of 3 of them. The mothers, in 
intdllizeton ce given a vocabulary test of 
foodiyorts a each child was given the 
‘ores wae | oat Questionnaire. Twenty-six 
Variables ae tained for a wide variety of 
i’ Honre ch as intellectual stimulation of 
fathers’ jn intelligence of parents, grand- 
mie eee as estimated by the 
Bence (Bruse me of Occupational Intelli- 
lzation of t} ell, 1932), happiness and social- 
adjustment he household, and child’s general 
ifferences The most consistent and marked 
Ntélligshos shoe in scores reflecting the 
index which . parents and grandparents. An 
intelligence gave weight both to parental 
Ntelligence and grandfathers’ occupational 
P< O01 produced a critical ratio of 3.32 
children, 11) between the groups, the 26 
Tom fants the ‘Gnerease” group coming 
Values than oe significantly higher index 
group, he 24 children in the “decrease” 
Th 
1 “oe of 28 subjects between 1941 and 
sel ection wii resulted in further upward 
, aidah h respect to initial (1931) mean 
-boints Sila 111.1 for the 1956 group, or 
standard: igher than the mean for the total 
ime oe group of these ages. At the 
terviewed he 1956 testing, subjects were in- 
status, ed about their occupation, marital 
about’ e ucation, and health. Information 
tune education and grandfathers’ 
availed ron was gathered by mail and is 
ral § ble for 98 of the 110 subjects. Ances- 
mdices were calculated using methods 


Se 


75 


similar to those employed by Bradway in the 
previous study. 


RESULTS 


Test Scores for 1941 Groups 


An earlier article (Bradway et al., 
1958) reported a correlation (7) of .59 
between preschool Stanford-Binet IQs 
and adult Stanford-Binet IQs obtained 
25 years later; the correlation between 
adolescent and adult IQs obtained in 
1941 and 1956 for the same group Was 
.85. They also found that the obtained 
1956 Stanford-Binet IQs were higher 
than would be expected on the basis 
of the 1931 and 1941 IQs of this group, 
although the standard deviations of 
the distributions were similar. 

Presumably, intellectual growth 
does not cease by the age of 16 as had 
been assumed in the 1937 Revision 
(Terman & Merrill, 1937). This group 
had attained a mean mental age of 18.5 
years as adults, instead of 16.8 which 
would have been predicted from their 
adolescent IQs in the absence of men- 
tal growth beyond age 16 years. This 
finding is in correspondence with the 
reports of several recent investigators 
that intellectual growth continues well 
past adolescence (Bayley, 1955; Bay- 
ley & Oden, 1955; Miner, 1957; Owens, 
1953). To equate for this continued 
growth and to permit meaningful com- 
parisons between an individual’s sta- 
tus within the group at the two test- 
ings, the mean difference between the 
1941 and 1956 IQs (11.5 points) was 
subtracted from the obtained 1956 
Stanford-Binet IQs.* The logic under- 
lying this correction is similar to that 
upon which the use of the Deviation 
TQ is based: namely, that the IQ is 
intended as an index of the indi- 
vidual’s relative standing within his 
age group, rather than as & measure 

4A forthcoming article by Bradway and 
Thompson will consider adult intelligence 
and the problems of appropriate indices. 


76 


of his level of mental development per 
se. There was no reason to suspect that 
these particular subjects had exhib- 
ited unusual mental growth which 
would have elevated the mean IQ of 
this sample above the mean IQ of the 
general population; and their Devia- 
tion IQs on the WAIS tended to 
substantiate this conclusion. Using 
standard scores would have been an 
alternate way of equating the two sets 
of scores, but since the standard devia- 
tions of the two sets of IQs were simi- 
lar, equating the means by subtract- 
ing the mean difference from each of 
the 1956 IQs was sufficient to make 
them comparable to the 1941 IQs as 
indices of relative position within the 
group. 

The means and standard deviations 
of the resulting corrected IQs of the 
present sample are included in Table 
1, arranged according to the subgroups 
to which they were assigned in 1941. 
In this table are presented also means 
of the Stanford-Binets administered 
in 1931 and 1941 and of the WAIS 
administered in 1956 to the same 
groups. The respective mean chrono- 
logical ages at the times of these 


KATHERINE P. BRADWAY AND NANCY M. ROBINSON 


three testings were 4.0, 13.6, and 29.5 
years. 

It will be noted from Table 1 that 
for both 1941 change groups, the ob- 
served changes to 1956 are in the re- 
verse direction from the previous 
changes. It would seem that the grouP 
whose rate of mental growth ha 
shown deceleration from preschool t0 
adolescence, however, had more nearly 
reached a level of brightness by d0- 
lescence that was largely maintained 
into adulthood. Subtracting their 1941 
IQs from their 1956 (corrected) 1Q§ 
yielded a t value for this decreas¢ 
group of 1.94, with a probability be- 
tween .05 and .10. On the other hand, 
the group whose mental growth ha 
shown acceleration between preschool 
and adolescence had tended to 1os¢ 
some of their relative gains by the 
time they reached adulthood. The co! 
responding ¢ value for this increas? 
group was —8.02, which is significa? 
at beyond the .01 level. Moreover, th? 
difference between the 1956 mean IQS 
for the two groups is reliably less 
than the difference between the 1941 
mean IQs for the two groups as indi- 


TABLE 1 
Mean IQ anv Stanparp Deviation or 1941 Suscrours ror Eacu Test 


1941 Subgroups 


™- a a Cl cr 

1931 Forms L, M 

Mean IQ* 112.8 108.8 111.3 117.1 

SD 21.0 12.8 15.1 15.9 
1941 Form L 

Mean IQ 97.3 127.9 111.7 112.0 

SD 17.2 11.0 13.7 16.8 
1956 Form L 

b 100.6 121.5 112.5 112 

- AQ (Corrs) 18.6 9.5 13.1 et 

1956 WAIS 108.8 
101.8 113.5 
Mean 1Q 14.0 7.6 9.1 pe 


® Average of two IQs minus one poi 
b 1Q minus 11.5 points to equate 104 


nt to correct for practise effect. 
4 and 1956 group means. 


1Q CHANGES OVER TWENTY-FIVE YEARS 


rer by a ¢ value of 3.59 which is 
8 Peed at beyond the .001 level. 

Pee index of the stability of the 
rei is provided by determining 
deen in fact, the groups remain 
pe Seer different from each other, 
ie Beran in 1941. Analyses of var- 
<n i the 1956 data yielded an F 
a Po the Stanford-Binet IQs 
didhe 16, of 7.40 for the WAIS Full 
ead a with probabilities of <.001 
ee respectively. The groups do, 
<o remain significantly different 

each other. 


Further I ; 
nvestigation of the “ ™ 
tral Index” f the “Ances 


F ee aoe to establish the general- 
ers 1e finding that ancestral intel- 
les € is related to the direction of 
ss + e mental growth, each subject 
at fa aye by mail to indicate how 
school h of his parents had gone in 
ns geet the occupation of each of 
ics bere This information 
ieais pn available for those sub- 
Dias wnt 1941 “change” groups, but 
had be known for the 68 subjects who 
group 7 members of the “no change” 
the 98 he mean initial (1931) 1Q of 
ty Subjects for whom these data 
a tities available was 112.3. The mean 
ce level of their parents was 
mean at which is comparable to the 
Salers level of California 
States p according to the United 
is Peg of the Census (1950). 
index h a 1941 study, the weighted 
dats te been calculated on ranked 
han or the 50° subjects in the 
to thet groups, giving equal weight 
a. ollowing: (a) mothers’ intelli- 
Censet as ranked by interviewer 
Seng). based on Vocabulary test 
ened (b) fathers’ intelligence as 
en ed by interviewer (largely based 
in Occupation), and (c) average of 
andfathers’ occupational scores on 
he Minnesota occupational classifica- 


77 


tion (Brussell, 1932). Ranks on each 
variable were translated to scores on a 
normalized distribution according to 
a method suggested by Hull (Garrett, 
1932, p. 113). Because the data con- 
sist of ranks within this selected sam- 
ple only, the absolute scores cannot be 
translated directly to other groups. 
‘A similar method was used in calcu- 
lating the ancestral index for the 1956 
data. In this case, however, the rank 
order of mothers’ intelligence was es- 
timated solely on the basis of educa- 
tion, an index admittedly inferior to 
Vocabulary test score, but the best 
index available to us. A rank order 
correlation of .72 was found between 
education and interviewer estimates 
(largely based on Vocabulary score) 
for the 40 subjects in the deviation 
groups for whom both figures were ob- 
tained in 1941. Fathers’ intelligence 
was estimated by education and Min- 


nesota occupational classification, 
equally weighted. Grandfathers’ oc- 
before. 


cupations were classified as 
Tach rank was translated into a score 
according to the Hull method, and the 
scores were combined to give equal 
weight to mothers’, fathers’, and 
grandfathers’ (combined) scores. 
For the 28 subjects in the “change” 
rned the questionnaire, 


groups who retw 
was obtained be- 


a correlation of .70 
tween ancestral indices based entirely 


on the 1956 information, and those 
based on 1941 information. Since it 
seemed probable that the accounts by 
the mothers were more accurate than 
those given 17 years later by the 
younger generation, the 1941 informa- 
tion was utilized wherever possible. 
The resulting indices were then cor- 
related with each set of test data, as 
shown in Table 2. Whatever the na- 
ture of the relationship between ances- 
tral index and intelligence, it appears 
that the main effects have occurred 
by the time of adolescence. The cor- 


78 KATHERINE P. BRADWAY AND NANCY M. ROBINSON 


TABLE 2 
CorreLation or IQs witH 
ANCESTRAL INDEX 


(N = 98) 

Test = | r/sigma® 
1931 Combined Forms L,M | .14 | 1.38 
1941 Form L -29. | 2.86** 
1956 Form L -23 | 2.27* 
1956 WAIS-Full Scale -382 | 3.15** 
1956 WAIS-Verbal -81 | 3.05** 
1956 WAIS-Performance -29 | 2.86** 


® Standard error of the correlation is .1015 when NV 
is 98. 

* Significant at .05 level. 

** Significant at .01 level. 


relation of the ancestral index with 
IQ did not quite reach statistical sig- 
nificance (p = >.1) in 1931 (pre- 
school level); it was significant in 
1941 (adolescent level); it remained 
significant, and without appreciable 
change, in 1956 (adult level). The 
t tests of the differences between these 
related correlations were calculated 
according to a method suggested by 
McNemar (1955, p. 146). The differ- 
ence between correlations of the ances- 
tral index and test scores obtained in 
1931 and 1941 reached a borderline 
level of significance (p = .07), but 
none of the correlations obtained in 
1941 and 1956 differed significantly 
from each other. 


Discussion 


Our data indicate that mental 
growth continues into adulthood (as 
others have reported). More specifi- 
cally, however, we found that a de- 
ecleration of mental growth rate from 
preschool to adolescence is not likely 
to be followed by further (permanent) 
changes in growth rate up to adult- 
hood. On the other hand, acceleration 
of mental growth rate from preschool 
to adolescence is more likely to be 
followed by deceleration (of a much 
smaller degree than the previous ac- 


celeration) than by further accelera- 
tion or by no change at all. 

The findings that the factors Te- 
lated to the rate of mental growth 
had their greatest effect during the 
earlier years of life is consistent with 
findings of the several other longitudi- 
nal studies which have correlated vat 
ious parental and environmental fac- 
tors with variation in children’s 1Q. 
For example, Bayley (1940) 1 the 
Berkeley Growth Study and Honzik 
(1940) in the Berkeley Guidance 
Study reported that the correlatioD 
between various economic and cul- 
tural indices in the home increased 
from 18 months to, but not beyon® 
various early childhood ages. 

This study was not, of course, 8P” 
proached with the expectation of shea- 
ding light on the fast-extinguishiDé 
nature-nurture controversy. The hope 
was, however, to establish some Pr@C" 
tical rules-of-thumb to aid the diag- 
nostician who is called upon to malt 
an estimate of a child’s future ment@ 
status. While no precise recipe h@* 
been derived, these data seem to i” 
dicate that the psychologist who tests 
a preschool child would be wise 
Shade his estimate of future statu® 
somewhat in the direction of the ge?” 
eral family level of intellectual ability: 
On the other hand, this would 2° 
hold for the junior high school cou? 
selor. So far, we have little which 
will improve his estimate beyond thé 
IQ obtained under favorable testing 
conditions, The counselor’s chances 9 
being reasonably accurate even with- 
out such further aids, however, are 
considerable; as noted earlier the 
correlation between the results of the 
administrations of Form J, jy 4941 
and 1956 is .85 for our group, 


SuMMARY 


In 1956, Form L of the St, - 
Binet, and the WAIS, wey, . 


IQ CHANGES OVER TWENTY-FIVE YEARS 79 


tered to 110 young adults who had 
omy been given the Stanford- 
eg at preschool and adolescent 
ges. In 1941, those subjects whose 
a had increased or decreased sig- 
cantly since 1931 were selected for 
Special attention. 
Present findings: 
ware each group, regression to- 
dint mean occurred between 1941 
nari 56, but the change in this 
“Mod was small compared with the 
Previous 10 years. 
berg ubiects whose IQs had increased 
+ eg preschool and _ adolescence 
ones more reversal in their posi- 
peal Ae aanee of the total 
tanec n did those whose IQs had 
er Significant changes in position in 
a Fah IQs at preschool 
en he distribution of IQs of the same 
aed at adolescence are related to 
Stral intelligence; this does not 


hold ¢; 
rue bet 
adulihons ween adolescence and 
he REFERENCES 
LEY, Nancy, Factors influencing the 


Srowth of intelli i i 
gence in young children. 
pearl, Nat. Soc. Stud. Educ., 1940, 39, 
Bay eit IE, 49-79, 
fn Nancy. On the growth of intelli- 
ore eel Psychologist, 1955, 10, 


Bayiey 
YLEY, Nancy, & Oven, Menita H. The 


maintenance of intellectual ability in 
gifted adults. J. Gerontol., 1955, 10, 91- 
107. 

Brapway, Katuertve P. An experimental 
study of factors associated with Stan- 
ford-Binet IQ changes from the pre- 
school to the junior high school. J. 
genet. Psychol. 1945, 66, 107-128. 

Brapway, Katuermse P., THOMPSON, CLARE 
W., & Cravens, R. B. Preschool IQs 
after twenty-five years. J. educ. Psy- 
chol., 1958, 49, 278-281. 

Brussety, EB. S. A revision of the Barr- 
Tassig Scale of Occupational Intelli- 
gence. Unpublished master’s thesis, Uni- 
versity of Minnesota, 1932. 

Garrett, H. E, Statistics in psychology and 
education. New York: Longmans, 
Green, 1932. 

Honzix, Marsorie P, Age changes in the re- 
lationship between certain environmen- 
tal variables and children’s intelligence. 
Yearb. Nat. Soc. Stud. Educ., 1940, 39, 
Part II, 185-205. 

McNemar, Q. The revision of the Stanford- 
Binet scale, Boston: Houghton Mifflin, 
1942. 

McNemar, Q. Psychological statistics. (2nd 
ed.) New York: Wiley, 1955. 

Miner, J. B. Intelligence in the United 
States, New York: Springer, 1957. 
Owens, W. A. Age and mental abilities: A 
longitudinal study. Genet. psychol. 

Monogr., 1953, 48, 3-54. 

Terman, L. M., & Merrett, Mavp A. Meas- 
uring intelligence. Boston: Houghton 
Mifflin, 1937. 

Uniren Staves Bureau OF THE Census. 
United States census of population: 
California. 1950, P-B 5. 


(Received July 14, 1960) 


Journal of Educational Psychology 
1961, Vol. 52, No. 2, 80-81 


A NOTE ON SCREENING SCHOOL BEGINNERS WITH 
THE BENDER GESTALT TEST 


ELIZABETH M. KOPPITZ 
Stanwood, Mt. Kisco, New York 


VERDENA MARDIS ano THOMAS STEPHENS* 
Montgomery County Public Schools, Dayton, Ohio 


There is a growing interest in the 
screening of school beginners both as 
a means of determining the readiness 
for learning and as a way to spot chil- 
dren with potential difficulties early 
so that they can be given the required 
attention and help when it will be 
most beneficial. The Lee-Clark Read- 
ing Readiness Test (Lee & Clark, 
1951) and the Metropolitan Readiness 
Test (Hildreth & Griffith, 1946) are 
two well standardized and widely used 
screening tests. The present study ex- 
plores the usefulness of the Bender 
Gestalt test (Bender, 1938) as a 
screening tool for beginning first grade 
students. 


MertHop 


Subjects. The subjects were 272 beginning 
first grade students from 11 classes in seven 
different schools, none were repeating the 
first grade. The schools selected represent a 
socioeconomic cross section and are located 
in rural, semirural, suburban, and urban 
areas. 

Procedure. During the first 6 weeks of the 
school year eight of the teachers adminis- 
tered the Lee-Clark Reading Readiness Test 
to their classes, while three other teachers 
gave the Metropolitan Readiness Test to 
their groups. During this same period each 
subject was seen individually by a psychol- 
ogist who administered the Bender Gestalt 
test. At the end of the school year the Met- 
ropolitan Achievement Test, Primary I Bat- 
tery, Form R, was administered to all 11 


1The authors wish to express their ap- 
preciation for the assistance provided by 
Sam Bonham and the staff of the Pupil Per- 
sonnel Department of Montgomery County, 
as well as by the teachers and principals of 
the schools included in this study. 


classes by their respective teachers. pei 
Readiness tests and the Achievement aade 
were scored by the teachers following Lae 
ard procedure for each test. The _ ists 
protocols were scored by the psycho Oe ta 
according to a system developed MF agi 
senior investigator for use with schoo. 
dren (Koppitz, 1958, 1960). the 
Actual achievement as measured oe the 
Total Average Achievement score o com 
Metropolitan Achievement Test was ricted 
pared for each subject with his pre aan 
achievement on each of the screening "15 
Predictive scores were derived fromm ne3 
grade equivalent scores on the Rea® 0 
tests and from standard deviation ST oe 
the Bender. Pearson product-moment, © a 
relations were computed between the the 
iness tests and the Bender, and between ap 
ihree screening tests and the Metrope” 
Achievement Test. 


Resutrs AND DiscussioN 


Table 1 shows the correlations _ 
tween the various tests, all of a 
similar magnitude and all statistic® : 
significant. Thus it appears that os 
Bender can predict Total Aweral™ 
Achievement as measured on the Me ¢ 
ropolitan Achievement Test as well Ke 
the Lee-Clark and the Metropolita? 
Readiness Test. It is of course, re” 
ized that the Lee-Clark Reading Te* 
is specifically designed to test readiné 
and is not meant to predict total aV~ 
erage achievement. However, in dis 
cussing the validity of the Reading 
Readiness test, Lee and Clark quoté 
Henig (1949) Who compareq the 
Reading Readiness test scores with 
actual reading grades at the eng of the 
school year and obtained a Correlation 


80 


BENDER GESTALT TEST WITH SCHOOL BEGINNERS 81 


o TABLE 1 

18} a 

Ree BETWEEN THE BENDER, 
ADINESS TESTS, AND ACHIEVEMENT 


Test 
School Lee-Clark & q k 
choot | av |tgesClank &| Render’ & | Ee oer 
A | 63) .67** —.67** | —.64** 
B |56| .42** | —.37** | —.30* 
C |40| .54** | —.41** | —.21 
: 26} .40* —.5g** | —.33 
24) .67** —.61** | —.54** 
Total 199 .66** —.6g** | —.61** 
Met. Read. &| ke 
ce Read) Bee ies Rees 
F | 31 
.63%* =| —.71** | —.73** 
. 42| .66** —.29 age 
Total | 73 -59** _58e* _.59** 


a 
A 5 Fi 
‘ts al Correlations with the Bender are negative since 
: hee is scored for errors. 
mY Resa at .05 level. 
ignificant at .01 level. 


a which is similar to the results 
Th in this study. 
tie : i pee between the vari- 
er, ictions of achievement and the 
ate achievement of each subject re- 
on some differences between the 
the eee tests. It appears that 
inte ee-Clark test tends to overesti- 
ta ee more often than the 
tices pd and the Metropolitan Readi- 
ieidion est. The latter two tests tend to 
te Pees se achievement more of- 
suble pets investigation on those 
Gee ‘ s who had marked discrepan- 
diet Fico their actual and pre- 
ts ed achievement suggests that the 
: e-Clark test is more strongly influ- 
need by cultural and social factors, 


while the Bender reveals apparently 
more the potential ability in visual- 
motor perception which may or may 
not be fully developed and put to use. 
Further exploration of these tentative 
findings seems indicated. 


SUMMARY 


The Bender Gestalt test and the 
Lee-Clark Reading Readiness Test or 
the Metropolitan Readiness Test, re- 
spectively, were administered to 272 
beginning first grade students. Test 
scores were correlated with each other 
and with actual achievement at the end 
of the school year. It was found that 
the Bender correlates well with the 
Readiness tests and can predict actual 
achievement as well as they can. 


REFERENCES 


Benpver, Lavrerta. A Visual Motor Gestalt 
Test and its clinical use. New York: 
American Orthopsychiatric Association, 
1938. 

Henic, M. S. Predictive value of a reading 
readiness test and teachers’ forecast, 
Elem. sch. J., 1949, 50, 41-46. 

HipretH, GERTRUDE. Metropolitan Achieve- 
ment Test, Primary I Battery, Form R. 
Yonkers-on-Hudson: World Book, 1946. 

HiprerH, GERTRUDE, & Grirrira, Nevis L. 
Metropolitan Readiness Test. Yonkers- 
on-Hudson: World Book, 1946. 

Koprirz, EvizABeETH M. The Bender Gestalt 
test and learning disturbances in young 
children. J. clin. Psychol., 1958, 14, 292- 
295. 

KKoprrtz, EvizapeTa M. The Bender Gestalt 
test for children: A normative study. J. 
clin. Psychol., 1960, 16, 432-435. 

x, W. W. Lee-Clark Read- 


Lex, J. M., & Cxark, 
ing Readiness Test. Los Angeles: Cali- 


fornia Test Bureau, 1951. 


(Received July 14, 1960) 


Journal of Educational Psychology 
1961, Vol. 52, No. 2, 82-90 


SOME RELATIONSHIPS BETWEEN PUPIL BEHAVIOR 
AND CERTAIN TEACHER CHARACTERISTICS 
DAVID G. RYANS 


University of Texas 


It usually is assumed that what goes 
on among pupils in a classroom is de- 
pendent to a major degree upon the 
presence of a teacher—a teacher whose 
role is to provide leadership in “learn- 
ing” activities at which the pupils 
themselves are initiates, unprepared 
without assistance to determine ap- 
propriate goals or to select and employ 
suitable ways and means of attaining 
such goals. The teacher, by virtue of 
education and experience, is expected 
to be able to guide and assist his pupils 
in the acquisition of usable knowledge, 
understandings, skills, and group ap- 
proved attitudes and value systems. It 
is unthinkable, except in very rare 
cases, that individual children or 
groups of children entirely on their 
own might successfully pursue the 
kinds of objectives that permit ad- 
justment to the complex social and 
material worlds in which we live. 

Qualitatively, at least, the logic of 
this argument appears sound: i.e., pu- 
pil behavior is a function of teacher 
behavior, the teacher being a neces- 
sary, though not sufficient, condition 
for purposeful and productive pupil 
performance. Beyond this point we 
usually are less willing to express our- 
selves regarding the nature of the 
pupil behavior-teacher behavior rela- 
tionship. We like to think specified, 
distinguishable teacher acts or teacher 
behaviors produce, or at least con- 
tribute to, certain distinguishable pu- 


1This research was made possible by a 
subvention from the Grant Foundation and 
vas sponsored by the American Council on 
a ducation. It was reported at the 1960 meet- 


ings of the American Educational Research 
i 


Association. 


pil acts or behaviors (Mitzel, oe 
Ryans, 1957, 1960). Some of us aan 
to postulate quantitative or func Sha 
relationships between kinds of aa 
behavior and pupil behavior— 854 at 
ing that more of a certain ig ore 
teacher behavior will result ee 
effective pupil behavior—and ‘eee 
to gather data and evaluate 5 986; 
dence (Anderson & eet ; 
Christensen, 1960; OgaD) hall 
Gnagney, 1960; Jayne, 1945; wine 
1952; and others). However, al 
functional relationships are not pe 
to demonstrate empirically, partly a 
cause of difficulties of observation tly 
mensuration and, undoubtedly, eee 
because of the complex inionet 
among teacher and pupil charac re" 
isties (Ryans, 1956). Certainly a 
search that might permit Ee ib8 
regarding dependency relations) : 
(producer-product relationships) » W! te 
classroom situations providing the ens 
ting, is difficult to arrange and aed 
through—largely due to limitatio? 
imposed by practicality. Fem 

A correlational approach (less ee 
factory than a dependency relation” 
Strategy for yielding the definitiv 
kinds of answers a researcher one 
seeks, but nevertheless providing 1° 
portant information about the exist 
ence and degree of functional relatio?” 
ships) often poses much less of a dat@ 
obtaining problem and, at the same 
time, provides an alternative ayenue 
to clues and cues—in this case, re- 
garding the influence of teacher be- 
havior on pupil behavior. 

It was this second sort of relation- 
ship (the interdependency oy go rreta- 
tional relationship) between “what 


PUPIL BEHAVIOR AND TEACHER CHARACTERISTICS 


Si ao do” and “what pupils of the 
= do with which the presently 
a 1. investigation was concerned: 
oie peg bes which certain in- 
fondle P ehavior tend to be 
iat lly related to, or to manifest 
ri mee in common with, certain in- 
Xes of teacher behavior. 


Meruop 


Pi of all, it is necessary to point 
ped hat the approach to pupil be- 
a dirs Salayss in this research was 
shear e Pia involving the immediate 
Device ion and assessment of the be- 
served of pupils. The pupils were ob- 
ie in their regular classrooms in 
= —— of their teachers—teach- 
sessed. also were observed and as- 
popatnittedly, this directly observed 
iffere ripe in process may be quite 
Rory n from the products resulting 
Mites behavior (Mitzel, 1960; 
1960) - Gross, 1956; Ryans, 1957, 
times : esearchers in the area some- 
ig oP ram it is more important 
‘s lag ant to judge pupil behavior 
able sh of its produets—the measur- 
shen kills, understandings, and atti- 
. a by pupils—than merely 
iter kee of ongoing pupil be- 
ie fter all, they point out, it zs 
nd product we are teaching for. 

(ities the method employed in the 
nena research does involve an ap- 
Unto Ng reasonable, though certainly 
pupil assumption: namely, that 
ieee carry on their classroom 
intended, in certain ways will acquire 
i. ed learnings more successfully 
Mii n pupils who behave or act in other 
a But the fact remains that edu- 
ion is interested ultimately in pupil 
ehavior and in teacher behavior for 
© results produced; immediate be- 
havior is of interest only insofar as it 
is a means to an end. So, one might 
Contend that the pupil behavior with 


83 


which this paper deals is of only in- 
cidental concern. The author does not 
agree, but it is a point of methodology 
that should be made clear at the out- 
set. 

Next, it is appropriate to explain 
what kinds of directly observable pu- 
pil behavior the research dealt with. 

Obviously many specific behaviors, 
or acts, may be noted in a classroom 
even during a very limited sample of 
time. One thing assumed in this re- 
search (a basic postulate of trait the- 
ory that makes the researcher's task 
tolerable) was that individual behav- 
iors possess some generality and there- 
fore may be classified in a fairly lim- 
ited number of classes or groups, each 
group being made up of related specific 
behaviors that have a common sub- 
stratum which identifies the group or 
class. Thus, a number of pupil be- 
haviors such as “rudeness to teacher 
and/or other pupils,” “interruption of 
one another,” “impatience,” “refusal 
to participate,” “quarrelsomeness,” 
“sullenness,” ‘disturbing noisiness,” 
ete. might all be considered as belong- 
ing to a class of behaviors we could, 
for convenience, label obstructiveness. 
Again, pupil behaviors such as “cour- 
teousness,” “frjendliness,” attentive- 
ness to advice or criticism,” “diligence 
in completing study assignments,” 
“initiation of constructive activity in 
the absence of specific direction from 
the teacher,” etc. could be thought of 
as belonging to a class of behaviors 
we might call responsibility. 


his research we selected (after con- 
e study of previous classroom and 
ch and after a good deal 
of trial and error involving classroom ob- 
servation and assessment) and focused our 
attention on 2 limited number of “impor- 
tant” pupil behavior dimensions. The term 
“dimension” was used with intent, since in 
planning the research it seemed reasonable 
to attempt to identify and assess acts of 
pupils in the classroom, first, in light of 


In th 
siderabl 
“personality” researt 


84 DAVID G. RYANS 


their qualitative characteristics (the general 
class of behaviors to which a specific act be- 
longed) and then, according to their posi- 
tions on quantitative scales defined by sub- 
classes of behaviors falling at opposite poles 
of the assumed continua. Thus, one bipolar 
dimension of pupil behavior selected be- 
cause it was judged to be an important one 
in the classroom was the Apathetic-Alert 
dimension. Specific pupil behaviors that 
seemed to contribute to alertness (e.g., “re- 
sponded eagerly,” “appeared anxious to re- 
cite and participate,” “watched teacher at- 
tentively,” ete.) defined one end of the 
continuum and pupil acts that appeared to 
contribute to apathy (“listless,” “restless,” 
“wandering attention,” “indifference,” etc.) 
defined the other. 

In attempting to assess pupil behavior, 


LOoADINGS OF OBSERVED Dr 


(From Ryans a 


Classroom Behavior Dimension 


trained observers employed several such di- 
mensions, assigning each class of pupils ob- 
served a value, on a scale extending from 1 
to 7, on each dimension. An assessment of 1 
(at the left pole) on the Apathetic-Alert 
dimension indicated an inference presum- 
ably based upon observation of a prepoD- 
derance of apathetic-type behaviors among 
the pupils; a 7 (at the right pole) indicated 
an inference based upon the presumed ob- 
servation of many alert-type pupil bebav- 
lors. A “Glossary,” which accompanied the 
assessment form and which was employed 12 
the training of observers, helped to stand- 
ardize the assessment procedure. 

Estimates of the reliability of assessment 
of the pupil behavior dimensions, based 02 
correlations between independent assess- 
ments of two observers visiting the same 


TABLE 1 


MENSIONS oF Pypry, 
Purity Brnavior Factor: S = 


EHAVIOR AND TEACHER ON A 
ECONDARY Scuoon Cuasses 
nd Wandt, 1952) 


= ctor le 
2 7 c D E F 

Pupil Behavior: 

Apathetic-Alert -07 01 

Obstructive-Responsible —"02 “03 “8 21 05 = —.05 

Uncertain-Confident 17 “02 -46 04 —.06 10 

Dependent-Initiating “00 ‘ 43 —.01 ‘17 —.03 
Teacher Behavior:* — 05 -55 04 2100 —.02 

Partial-Fair 

Autocratic-Democratic — .04 

Aloof (G)-Responsive — .02 

Aloof (I)-Responsive 125 

Restricted-Understanding -02 

Harsh-Kindly -08 

Dull-Stimulating — .02 

Stereotyped-Original -32 

Apathetic-Alert At 

Unimpressive-Attractive -12 

Inarticulate-Articulate -O1 

Monotonous (V)-Pleasant —.04 

Evading-Responsible —.01 

Erratic-Steady —.08 

Excitable-Poised -08 

Uncertain-Confident _ -06 

Disorganized-Systematic +20 

Inflexible-Adaptable -02 

Pessimistic-Optimistic -00 

Immature-Integrated -19 

Narrow-Broad is 2 


Note.—N = 249 senior high school mathematics, science, 


 Londings of teacher behavior dimensions are omitted ten and social studies classes, 


mensions were not pronounced. 


‘ctors on which the loadings 
f Dunit 
behavior di- 


PUPIL BEHAVIOR AND TEACHER CHARACTERISTICS 


classes at different times ranged from .43 to 
65 for 1,518 elementary school classes ob- 
served, and from 43 to 63 for 1,911 second- 
ary school classes. 

Teacher behavior, observed and assessed 
following the same’ procedure as that de- 
stg for. pupil behavior, was considered in 
et of dimensions such as Harsh-Kindly, 

isorganized-Systematic, Dull-Stimulating, 
ete. (see Tables 1 and 2 below). Reliability 
aes of the assessments of the several 

imensions of observed teacher behavior, 
aig on correlations of the assessments of 
Men rst observer and second observer, gen- 
ally were between 50 and 60—for ele- 
pe teachers coefficients for the differ- 
a imensions clustered around .55, and for 

ondary school teachers, around .60. 
~. minimum of two independent sets of 

essments was obtained for each class of 


85 


pupils and for each teacher, the separate 
ratings subsequently being weighted equally 
(by transformation to a scale employing 2 
common mean and standard deviation) and 
combined to form composite assessments for 
each teacher and each class of pupils on each 
of the bipolar dimensions considered. 

The teachers and classes of pupils upon 
which data of this report are based, gener- 
ally, were from the school systems of fairly 
large cities. Geographically, the midwest and 
west accounted for a majority of the school 
systems sampled (93%). The classes repre- 
sented 274 elementary schools and 103 sec- 
ondary schools. Because of dependence upon 
the voluntary cooperation both of admin- 
istrative offices responsible for school sys- 
tems and of individual teachers, the sample 
was not selected so as to constitute a known 


probability sample. 


TABLE 2 


L 
a oy or OpsERVED DIMENSIONS 
wo Factors INVOLVING, IN PaRT, 


or Pupin BEHAVIOR AN 
“Pyprt PARTICIPATION” AND 


p TEACHER BEHAVIOR ON 
“CoNTROLLED 


Purin Activity’: ELEMENTARY ScHooL CLASSES 
(From Ryans, 1952) 


Oblique Factor 


Classroom ‘i . z 
Behavior Dimension ri = a a = 
Pupil Behavior: 
Oisinterested-Alert 30 50 Be 08 00 
structive-Constructive 16 63 —.07 .07 o4 
pestrained-Participating 40 10 o4 .00 18 
Rude-Self-Controlled —.01 66 10 160 —.05 
pbathetic-Initiating 1) 29 ‘050-08 ‘02 
Te ependent-Responsible 46 AD 19 —.03 —.10 
acher Behavior:* 
Partial-Fair el .06 
Autocratic-Demoeratic 42 —.1l 
R of (G)-Responsive .00 —.09 
Fen ocisd-Understending 27 — .06 
hattractive-Attractive .00 .02 
I isorganized-Systematic —.01 56 
harticulate-Fluent -00 .33 
Hpiflexible-Adaptable .36 02 
arsh-Kindly 125 “08 
Apathetic-Alert 05 08 
eu (I)-Responsive —.02 —.10 
tereotyped-Original ° 52 14 
hangeable-Constant — .04 54 
xcitable-Calm —.00 -36 
Uncertain-Confident —.02 Ad 
rresponsible-Responsible -04 .60 
‘essimistic-Optimistic -05 04 
Infantile-Mature +21 44 


Note.—N = 275 third and fo 
din, Lo*inss of teacher bebavi 
imensions were not pronounced. 


uurth grade elementary school classes. 
‘or dimensions are omitted here for fas 


ctors on which the loadings of pupil behavior 


86 


ReEsuLts 


Estimates of pupil behavior and of 
teacher behavior, obtained by the di- 
rect observation procedures described 
above, were subjected to correlational 
analyses. It should be noted that the 
findings, therefore, have to do simply 
with concomitance and permit infer- 
ences of interdependency 
only. Inferences regardi 
ent-consequent” or 
uct” relationships m 
by such data, but the design employed 
precludes the testing of dependency 
relationship hypotheses, 


relationships 
ng “anteced- 
“producer-prod- 
ay be suggested 


Relationships among the Pupil and 
Teacher Behavior Dimensions 


One set of findings to be reported 
has to do with the extent to which the 
observed pupil behavior dimensions 
were related to the concurrently ob- 
served teacher behavior dimensions, 

Here, in fairness to inferences that 
may be drawn, it should be noted 
that the pupil behaviors of the class of 
a given teacher were observed and as- 
sessed during the same Session, or pe- 
riod, during which the teacher of that 
class was observed and assessed. Ob- 
viously there well may be contamina- 
tion of one set of assessments by the 
other. (It would have been Possible to 
at least partially correct for this, but 
it also would have added materially 
to both dollar and time expenditures; 
consequently, the choice was to recog- 
nize this source of error variance, but 
not to attempt to control it.) 

In reviewing the pupil behavior- 
teacher behavior relationship findings 
the results for elementary teachers and 
secondary teachers will be presented 
separately ; inferences that may be 
drawn are somewhat different. The in- 
teraction of grade level with relation- 
ships of this sort appears to be pro- 
nounced. . 

Relationships in 


the secondary 


DAVID G. RYANS 


school. In the secondary school as 
& Wandt, 1952), all of the pupil be” 
havior dimensions considered in d= 
research tended to be interrelate 
the intercorrelations ranging oa 
to .68. They tended to hang toge to 
as a single cluster, all eee 
the same oblique factor when ti a 
tercorrelations of pupil behave na- 
teacher dimensions were factor “aioe 
lyzed. Table 1 shows relevant rable 
loadings. It may be noted from dis- 
1 that the factor defined by the spi 
tinctly homogencous cluster of Sate 
behavior dimensions was not nate 
ficantly contributed to by the teae ith 
behavior dimensions assessed, sod 
one exception (the Dull-Stimula 
teacher behavior dimension). Of ily 
the teacher behavior dimensions, 9” 
that which had to do with the poe 
to which a teacher was judged Du es 
Stimulating seemed to be closely a 
Sociated with the pupil behavior ¢ 
mension cluster. 5 a 
n the secondary school, pupil be 
havior (as here defined and assesse 
Seemed not to be ag closely related t© 


teacher behavior ag we sometimes 45° 
Sume it to be, 


elationships in the elementary 
School. In the elementary scho° 
(Ryans, 1952) we note a distinctly 
different 


Sit j resents 
results eo: aren: Table 2 


mparable to those shown £oF 
tl : 
bie ee School in the preceding 


0 
data, however 4. elementary scho 
pupil behavio? the tight clustering 9 


inde mensions, relatively 
mane eacher behavior di- 
18 absent. Ts Five high school), 
dimension ye 


investie, «PUPIL behavior 

correlations range ed—with inter- 
: ro; 

split up and fell into 9 = -21 to aon 

of two loosely correlateg coe ~ 

OF groups, of comb; ed pun ) factor ’ 

and teacher behavior g?UPil behavior 
aa letnchitary Schoo] nad the 

F 


PUPIL BEHAVIOR AND TEACHER CHARACTERISTICS 87 


loadi 
ae of the pupil behavior dimen- 
oat = fell on two somewhat dif- 
hie, ne ssooage rather than on a single 
heel = the case when secondary 
We classrooms were considered. 
iio behavior dimensions con- 
tos toe "e to the first factor appeared 
Duticinett having to do with pupil 
of ‘pil den And related to this set 
Skee cimensions were those teacher 
labeled opens that had been 
Pt Mp en Auto- 
Adaptabl ocratic and — Inflexible- 
the Dull | creel not entirely unlike 
it sient timulating teacher behavior 
analysis .* in the secondary school 
Spies a oe kinds of relationships 
well = btn sense—common, as 
which ; pupil behavior dimensions, 
a mae might be referred to 
diided = ed pupil activity, were 
included a second factor, which also 
theneion such teacher behavior di- 
Saeed be Irresponsible (evading) - 
leased Disorganized-Systematic, 
Ste e-Constant, Infantile-Ma- 
a Uncertain-Confident— 
inoniaie laracteristics that suggest 
nesslikee ized, irresponsible vs. busi- 
th ri —— teacher behavior. 
: “irc ementary school, then, pupil 
8 re sind teacher behavior seemed 
than ae a noticeably interdependent 
Barbies oe secondary school, with 
is bs at ing pupil behavior seeming 
ane cons to flexible, original, dem- 
trotted eacher behavior and con- 
‘Sparmiee o behavior to systematic, 
Thal ible teacher behavior. 
fees a. may be noted parenthetically 
eras ne analyses undertaken in con- 
Fi m with this research yielded evi- 
a e suggestive of a moderate posi- 
mee between the factors 
2 ibuted to by the pupil behavior 
. ensions and those teacher behavior 
ee that loaded on a factor 
hich seems to refer to “the way the 


teacher appears” to his or her class 
(teacher behavior dimensions such as 
Monotonous-Pleasant (voice), Unim- 
pressive-Attractive, Inarticulate-Ar- 
ticulate, and Uncertain-Confident con- 
tributed to this teacher appearance 
pattern). This sort of relationship be- 
tween pupil behavior and teacher ap- 
pearance seemed to hold to some 
degree among both elementary and 
secondary school classes. In the sec- 
ondary school, the median correlation 
between the teacher behavior dimen- 
sions that contributed to teacher ap- 
pearance and the pupil behavior di- 
mensions was .27; and the correlation 
between the teacher appearance factor 
and the factor dominated by pupil be- 
havior (Factor C of Table 1) was 83, 
In the elementary school, the median 
correlation between assessments on the 
teacher behavior dimensions that re- 
lated to teacher appearance and as- 
sessments of the pupil behavior di- 
mensions was .37; and the correlations 
between the teacher appearance factor 
and the factors contributed to by the 
pupil behavior dimensions (Factors A 
and B of Table 2) were (57 and .23, 
respectively. 
Relationships Based on an Overall In- 
dex of Pupil Behavior 

Now, to turn to another related as- 

pect of the analyses which supports 


the foregoing findings, although com- 
ing at the problem in a slightly differ- 


ent manner: 
n of the studies just 


In an extensio 
discussed, but with new independent 
samples of teachers and classes, an in- 


dex (based on the combination of as- 
sessments for teacher behavior dimen- 
sions that loaded most highly on 
factors generated by the earlier factor 
analyses—Ryans, 1952, and Ryans & 
Wandt, 1952)? was obtained for each 


2 Assessments of 
teacher behavior di 


each observer on each 
mension were trans- 


88 


DAVID G. RYANS 


TABLE 3 


CoRRELATIONS BETWEE 


Sample 


Elementary school classes: 
834 Grades 1-6 classes 
144 Grades 1-6 classes 
Secondary school classes: 
497 Mathematics and science classes 
568 English and social studies classes 
114 Mathematics, science, English, and 
social studies classes 


N THE Puprn Benavior INDEX AND TEACHER BEHAVIOR PATTERNS 
"et, se "Pozo 
82 -80 75 
83 -78 80 
20 18 21 
18 21 26 
07 wll 14 


Note.—Po = pupil behavior (composite). 


Xo = kindly, understanding vs. aloof, restricted teacher behavior. 


Yo = responsible, businesslike vs. evading, un) 
Zo = stimulating, imaginative vs. dull, routin 


teacher on each of three prominent 
patterns of teacher behavior, and the 
teacher behavior pattern Scores then 
considered in relation to pupil be- 
havior indexes (a composite score for 


each class—P,—obtained from the as- 


sessments assigned the pupil behavior 
dimensions)? of the teachers’ classes, 
The teacher behavior patterns con- 
sidered were ones suggested by the 
earlier factor analyses: (a) aloof, re- 
stricted, egocentric vs. kindly, under- 
standing, warm teacher behavior; (b) 
evading, unplanned, slipshod ys. Te- 
sponsible, systematic, businesslike 
teacher behavior; and (c) dull, rou- 


formed into standard scores. Then for each 
pattern of teacher behavior (X,, 5, Ze) 
the standard scores of the several compo- 
nents were summed to provide the three 
pattern scores—based on the observer's as- 
sessments of a particular teacher. These pat- 
tern scores were, in turn, transformed into 
standard scores for each observer; and 
standard scores for independent observers 
were combined to yield a composite assess- 
ment on each of the behavior patterns for 
each teacher. : 

The pupil behavior index was obtained 
simply by (a) summing for each class, each 
observer's assessments (previously trans- 
formed to standard scores) on the several 
pupil behavior dimensions and (b) comput- 
ing the mean of the equally weighted indexes 
contributed by the several observers who 
assessed a particular class of pupils. 


planned teacher behavior. 
‘e teacher behavior. 


tine ys. stimulating, 
teacher behavior, 
Reliabilities of the indexes (scores) 
for the patterns of teacher behavior 
Just named (based on correlations b& 
tween the indexes yielded by asseS8 
Ments of different observers who inde- 
Pendently assessed the same teachers) 
ranged from .70 to 0, Reliability 
estimates of the pupil behavior index 
Were slightly Jo 
ple of 9g 
havior 


imaginative 


wae Cients of .62 84, 72, 


than 30 m: minutes apart, more 
= Part but during the 
5 8Y, during different halves 
b ay (fore 
ut the same 


°Y, and at least one 
a 

Part but during the same 

between pupil behavior 

behavior based 

a’ lik eetneadtity 

ike number 0 

ol Classeg are presented 


acher relationships. 
Chavior pattern 


PUPIL BEHAVIOR AND TEACHER CHARACTERISTICS 


a correlated with the pupil 
behavior index for secondary school 
ries of mathematics, science, Eng- 
= , and social studies, the evidence 
ce low positive relationships 
ro pupil behavior and teacher 
ie ra the obtained correlation co- 
: ea between the pupil behavior 
ndex and the three teacher behavior 
ae scores being between .07 and 
It probably may be inferred that 
: relationship is not pronounced in 
Secondary school classes. 
aie eid school relationships. 
ith elementary school classes, the 
ec oe behavior index was found to be 
aay highly correlated with all 
“a teacher behavior pattern scores 
hates as suggested by the analyses 
Fal in the preceding sections of 
ent paper), the correlation coeffi- 
ion ranging from .75 to .83. (It may 
— that these pupil behavior- 
wae er behavior correlations in the 
ot entary school were as high as the 
iability coefficients for the several 
pattern scores.) 
we for the kinds of teacher be- 
a maa and pupil behavior described, 
i Wo separate analyses support the 
a conelusion: ie., that teacher be- 
Sts es and pupil behavior show sub- 
se ntially more interdependence in the 
ementary school as compared with 
ne secondary. There also is the sug- 
= algae that of the teacher behavior 
ote aren and patterns studied, Dull- 
imulating teacher behavior may be 
More closely associated with second- 
ary school pupil behavior. 

Such findings, it should be recalled, 
are based upon correlational analysis, 
Permitting interdependency, but not 
dependency (producer-product), in- 
erences about the relationships in- 
volved. Furthermore, the relationships 
reported refer to group data—to 
Zroups of teachers and their classes— 
and the probability risks involved in 


89 


applying conclusions to specific situa- 
tions should be kept in mind. 


SUMMARY 


Relationships between trained ob- 
servers’ assessments of (a) classes 
of pupils and (b) the teachers of 
those classes, relative to selected di- 
mensions of pupil and teacher class- 
room behavior, were studied. 

For elementary school classes, high 
positive relationships were noted be- 
tween observers’ assessments of “pro- 
ductive pupil behavior” (e.g., assess- 
ments presumed to reflect pupil 
alertness, participation, confidence, 
responsibility and self-control, initiat- 
ing behavior, etc.) and observers’ as- 
sessments of previously identified 
patterns of teacher behavior which 
seemed to refer to understanding, 
friendly classroom behavior; organ- 
ized, businesslike classroom behavior; 
and stimulating, original classroom 
behavior. 

For secondary school classes, low 
positive relationships appeared to ob- 
tain between productive pupil behav- 
ior and the above named categories 
of teacher behavior, with a tendency 
for the stimulating, original teacher 
classroom behavior pattern to show 
a slightly higher correlation with pupil 
behavior than the understanding, 
friendly or the organized, businesslike 
teacher behavior patterns. 

The approach was correlational, 
permitting inferences of interdepend- 
ency, but not producer-product, rela- 


tionships. 
REFERENCES 


Anverson, H. H., & Brewer, J. E. Studies of 
teachers’ classroom personalities: II. Ef- 
fects of teachers’ dominative and in- 

ve contacts on children’s class- 


tegrati 
room behavior. Appl. psychol. Monogr., 
1946, No. 8. 

Curistensen, C. M. Relationships between 


pupil achievement, affect-need, teacher 


90 


warmth, and teacher permissiveness, J, 
educ. Psychol., 1960, 51, 169-174. 

Coca, L. Behaviors of teachers and the pro- 
ductive behavior of their pupils, J, exp. 
Educ., 1958, 27, 89-124. 

Gnacyey, W. J. Effects on classmates of a 
deviant student’s power and response to 
& teacher-exerted control technique. J. 
educ. Psychol., 1960, 51, 1-8. 

Jayne, D. D. A study of th 


e relationship be- 
tween teaching Procedures and educa- 
tional outco 


mes. J. exp, Educ., 1945, 14, 
101-134, 


Mirzen, H. B. Criteria of teacher effective- 
ness. In, Encyclopedia of educational re- 
search. New York: Macmillan, 1960. Pp. 
1481-1486. 

Mirzet, H. bE & Gross, Ceciry F. 
review of the development of pupil 
growth criteria in Studies of teacher ef- 
fectiveness. (Research Series 31) New 
York: Board of Higher Education, Of- 
fice of Research and Evaluation, 1956. 

Ryans, D. G. A study of criterion data: A 
factor analysis of teacher behaviors in 


the elementary school. Educ. psychol, 
Measmt., 1952, 12, 333-344. 


A critical 


DAVID G. RYANS 


e 
Ryays, D. G. Theory development and 
study of teacher behavior. J. educ- 
chol., 1956, 47, 462-475. es e 
Ryans, D. G. Notes on the criterion aan 
in research, with special ie age gencls 
study of teacher characteristics. J. 
Psychol., 1957, 91, 33-61. ie 
Ryays, D. G. Teacher characteristic’ 7 ser: 
lation to marital status of poe : 
Calif. J. educ. Res., 1959, 10, 6 ffective- 
Ryans, D. G. Prediction of teacher sail re- 
ness. In, Encyclopedia of pear Pp. 
search. New York: Macmillan, 
1491. oii 
Seeae ts o. & Wanopt, E. A factor oe 
of observed teacher behavior sates nthe 
ondary school: A study of gers 574 
Educ. psychol. Mcasmt., 1952, 125 
, eee 
ao J. Assessment of the gees 
tional climates experienced ar Si from 
of seventh graders as they mov Fesioile 
class to class, Educ. psychol. 4 
1952, 12, 440-451. 


(Received July 28, 1960) 


Journal of Educati 
Educational Psych 
1001, Vol. 52, No. 2, 01-97 ology 


INVENTORY ESTIMATED TEACHER CHARACTERISTICS 
AS COVARIANTS OF OBSERVER ASSESSED 
PUPIL BEHAVIOR' 


DAVID G. RYANS 


University of Texas 


Re ned report, Ryans (1961) 
indicative c eae observation data 
sisal at of: (a) among elementary 
sing i high positive relation- 
of Po observers’ assessments 
Oconee pupil behavior” (e.g.; 
Lots presumed to reflect pupil 
Spongibility « ticipation, confidence, re- 
leas, y and self-control, initiating 
dint = ete.) and observers’ assess- 
of teach previously identified patterns 
refer rec behavior which seemed to 
om ee friendly class- 
Sane wae organized, businesslike 
ovluttial ol ehavior; and stimulating, 
ationa assroom behavior; and (b) 
pons school classes, low 
m8 on i ationships between produc- 
naniet we behavior and the above 
with a : oe ale of teacher behavior, 
Gtlgina’ ency for the stimulating, 
pattern ¢ eacher classroom ; behavior 
relation 0 show a slightly higher cor- 
idarer rs pupil behavior than the 
iieontge ing, friendly or organized, 
Il, a ae teacher behavior patterns. 
dere, e evidence suggests, inter- 
piven eis 2 ; relationships between 
tT ol ner behavior and overt pupil 
strabla r in the classroom are demon- 
ended a question which next is sug- 
i ee whether or not similar re- 
hone ships _ exist between personal 
Sadied Coo of the teacher as re- 
her ed by self-report inventory scores 
overt pupil behavior in teachers’ 

s made possible by a 
Grant Foundation and 
American Council on 


ted at the 1960 meet- 
sychological Associa- 


ae 
at This research wa: 
se bvention from the 
Ea, sponsored by the 
Education. It was reper’ 
Ings of the American P: 
tion. 


of 


classes—whether or not inventory es- 
timated teacher traits correlate with, 
and therefore may be used to predict, 
pupil classroom behavior. It was to 
this problem the present research was 
addressed. 


Two hypotheses derived from earlier 
studies (Ryans, 1961) were con- 
sidered: 


1. Certain teaching oriented, trait 
(or “characteristic”) scores yielded by 
teachers’ responses to a self-report in- 
ventory, the Teacher Characteristics 
Schedule, covary in a nonrandom man- 
ner with indexes of observer assessed, 
overt pupil behavior in the teachers’ 
classes, permitting better-than-chance 
prediction of one from the other. 

2. Teacher characteristic-pupil be- 
havior relationships are more notable 
in elementary than in secondary school 


classes. 
Meruop 


Pupil Behavior Data 

The classroom activities of pupils 
were directly observed by trained and 
experienced observers. The direct ob- 
servation approach involved a reason- 
able, though certainly unproved, as- 
sumption: namely, that pupils who 
carry on their classroom activities in 
certain ways will aequire the intended 
“earning” (i.€., knowledge, under- 
standings, attitudes) more success- 
fully than pupils who behave in less 
seemingly productive manners. 

Tt is important to note the particu- 
lar kinds of pupil behavior that were 
observed and assessed—thus providing 
an operational definition of pupil 


92 


classroom behavior as considered in 
this research. Obviously, the number 
of specific pupil behaviors occurring 
in a classroom is almost unlimited. It 
was necessary, therefore, to make the 
usual assumption that such specific be- 
haviors possess some generality and 
that they may be classified in a fairly 
limited number of groups or classes, 
each group being made up of related 
behaviors possessing a common sub- 
stratum which identifies the behavior 
group. Thus, specific pupil acts involv- 
ing “rudeness to teacher and/or other 
pupils,” “interruption of one another,” 
“refusal to participate,” “quarrelsome- 
ness,” “disturbing noisiness,” and the 
like might reasonably be thought of as - 
belonging to a class of behaviors la- 
beled, for convenience, obstructive- 
ness. Or, specific pupil behaviors 
involving “courteousness,” “frjend]i- 
ness,” “attentiveness to advice or criti- 
cism,” “diligence in completing study 
assignments,” “initiation of construc- 
tive activity on one’s own,” and the 
like, might be grouped together in a 
class of behaviors we might call ye- 
sponsibility. 


The groups or categories of pupil be. 
havior selected for consideration were de- 
termined in light of review and analysis of 
previous classroom and personality research 
and after extensive pilot classroom obser- 
vation and assessment durin 
phases of the study. 

For assessment purposes, the major pupil 
behavior categories to be considered were 
cast in the form of dimensions, thus, per- 
mitting the identification and assessment of 
observed pupil behavior (a) in the light of 
its essential qualitative characteristic (the 
general class of behaviors to which a spe- 
cific act belongs) and (b) according to its po- 
sition on a quantitative scale defined by sub- 
classes of behavior falling at opposite Poles 
of the assumed continua. Thus, one bipolar 
dimension of pupil behavior that observers 
were trained to assess was an Apathetic-Alert 
dimension. Pupil acts or behaviors that 
seemed to contribute to alertness (e.g., “re- 
sponded eagerly,” “appeared anxious to re- 
cite and participate,” etc.) defined one pole 


€ preliminary 


DAVID G. RYANS 


of the dimension and behaviors contributing 
to apathy (e.g., “listlessness,” ‘restless 
“wandering attention,” etc.) defined t 
other. p 
Four such dimensions of pupil belay 
were observed and assessed: Apathetic-Aler, 
Obstructive-Responsible, Uncertain-Con 
dent, and Dependent-Initiating~ ea be 
In assessing pupil behavior, traine oi 
servers assigned each class of pupils Bae 
served a value, on a scale extending ae 
1 to 7, on each dimension. An assessmen es 
1 (at the left pole of the scale) on the au 
thetic-Alert dimension indicated Pee tic 
observation of a preponderance of apathe ht 
behaviors; an assessment of 7 (at the as 
pole) indicated presumed observation es 
many alert-type behaviors. Assessments as 
made for intact classes—for classes rane 
whole. A “Glossary,” which accompanied ae 
assessment form and which was employed a: 
the training of observers, helped to stan! 
ardize the assessment procedure. f 
Reliability coefficients for assessments a 
the several pupil behavior dimensions, base 
on correlation of independent assessments © 
first observer and second observer, range 
from 43 to 65 for some 1,500 elementary 
School classes and from 43 to 63 for 8P~ 
proximately 1,900 secondary school classes: 
tercorrelations among the pupil behav- 


ior dimensions were substantial, rangiDé 
ae to 68 for secondary school classes 


Tom .21 to 65 for elementary schoo 
classes. (Factorially, the pupil behavior @- 
on usions for secondary school classes 1oade¢ 
on a single factor; for elementary schoo 
oe they contributed to two correlate 

Actors, appearing to be roughly describable 


as pupil participat; i 
activity.)> Tticipation and controlled pup 


minimum of 
€assessme: 


Ormation to 
common mean and 
averaged to provi 


*Tt m: 
behavior gee Roted t at no 


vier dimengj ne of these pupil 

Imnowledge etwite, a4 “tly reflects pupi 
ei Be, und . é 

which mucl erstan, eaming 0 


in, 
ch o} 88, and g.; 
But most eee school’s efforts toward 
tive attentiven, Onn Practices a. is directed. 
tud “SS, Purposes, TL agoy 2Ssume ac- 
study, reasoned Self-assuy. and systematic 
quiring and initiat; a 


of the pupil whi in ina (os and an jn- 
pupil which are eondug,P® conditions 
© to learning. 


TEACHER CHARACTERISTICS AND PUPIL BEHAVIOR 93 


of each class of pupi i 
dimensions cheat cae in each of the bipolar 
ae — behavior index, Po, for each class 
poh Seg was obtained by combining the 
basiee oi assessments on all four pupil be- 
single de agree, equally weighted, into a 
menaions Bet CRORES of the several di- 
Btls foc 0 a single index seemed reason- 
in light ae pas standpoint—as well as 
noted cho their intercorrelations, which, as 
school elu a suggested a single secondary 
cleinents ster and two by no means distinct 
all ieee It was with this over- 
teacher ra , that the inventory estimated 
conelated, aracteristics subsequently were 
h Ty 
7 rh Sigrid of the pupil behavior in- 
S four at erall estimate contributed to by 
analysis eae gy was moderate. In one 
e pupil beh, on 98 classrooms) in which 
WO obsery avior dimension assessments of 
the result; ers were separately summed and 
obsérree ing Py values correlated, the inter- 
gees I re ge were .62, 84, .72, and 
ade ap ively, when the observations were 
than 3g sae 30 minutes apart, more 
half-day nd apart but within the same 
half. eda uring the same day but different 
but wai and during the same school year 


With . 
Month, an interval of more than one 


Te 
‘Acher Characteristics Data 


The 
Sploged on Characteristics Schedule, 
traits” 9 ae obtain estimates of teacher 
Self-repo : escribed below, was an omnibus 
items cel type of inventory made up of 
Prepared ected from a number of specially 
een subj separate instruments which had 
Spotbecccieare, to a series of preliminary re- 
Its final peas and validation studies. In 
multiple ha the schedule consisted of 300 
0 Péter ous and.check-list items relating 
quently a preferences, self-judgments, fre- 
ata, and Fe ge activities, biographical 
® Score on e like. Ten scores (the last one, 
tempt ¢ a control variable used only to at- 
en denies identify individuals with a strong 
sp onses) to make “socially acceptable” re- 
ache oe obtainable with the use of the 
© the r : haracteristics Schedule, Responses 
vali ee had been selected and cross- 
chara (Ryans, 1960) to estimate teacher 

cteristics which were identified as: 
* warm, understanding, friendly vs. aloof 
ocentric, restricted classroom behavior ‘ 
= responsible, businesslike, organized vs. 
yeaine, unplanned, slipshod classroom ee 
or 


g stimulating, imaginative vs. dull, rou- 
tine classroom behavior 
favorable vs. unfavorable opinions of 
pupils 
favorable vs. unfavorable opinions of 
democratic classroom procedures 
favorable vs. unfavorable opinions of 
administrative and other school personnel 
learning centered (“traditional” or con- 
servative) vs. child centered (“permis- 
sive” or liberal) educational viewpoints 
superior verbal understanding (compre- 
hension) vs. poor verbal understanding 
emotional stability (adjustment or ma- 
turity) vs. instability 
validity vs. invalidity 
control variable) ‘ 
Generally, the reliability coefficients (sta- 
bility and equivalence estimates) for scores 
derived from the Teacher Characteristics 
Schedule fell between .70 and 80. Validity 
coefficients (correlations between schedule 
scores and [a] observers’ assessments of 
teacher classroom behavior, or [b] direct 
inquiry-type items relating to certain con- 
structs or traits, €.g., attitudes, verbal ability, 
etc.) were of varying magnitude, depending 
upon the aspect of validity investigated 
(cross-validity, validity generalization, Va- 
lidity extension, concurrent validity, predic- 
tive validity), the particular teacher charac- 
teristic estimated, and the teacher sample 
(e.g., elementary ‘teachers, secondary teach- 
ers, mathematies-science teachers, etc.) for 
which a particular scoring key was derived 
and to which it might most appropriately be 
applied. Concurrent validity coefficients tyP!~ 
cally ranged from 20 to 50 and cross-valid- 
ity coefficients between .40 and .60. Coeffi- 
cients of predictive validity were low, 
exceeding .20 only jn two or three instances. 


Sampling and Analysis 
Data from over 2,000 elementary and sec- 
ondary school classes, widely scattered geo- 
d and analyzed. 
lasses represented 
y ly large cities. 
Since cooperation of school systems, schools, 
was of necessity voluntary, the 


and teachers 
sampling could not comply i 
theoretically required for comparison of the 
data with known probability models. ' 
The teacher characteristics-pupil behavior 
data were analyzed separately for seven sub- 
samples, three from elementary schools and 
four from secon! 


dary schools. The sizes of 
the subsamples varied, as shown in Table 
1 below, from 99 to 718 classes of pupils and 
teachers. In each sample, and for each of the 


of response (a 


94 


DAVID G. RYANS 


TABLE 1 


Propuct-MomentT CorRELATION 
BEHAVIOR AND CERTAI 


” UPIL 
CorrriclENTs BETWEEN “OpsERVER ASSESSED : 
N “Inventory Estmmatep” Teacner CHARACTERISTIC 


“Inventory Estimated” | Elementary School Samples Secondary School Samples? 
Teacher Characteristic (with 
which ‘Observed’? Pupil H 
Behavior, Po,is compared) A B c D E F G 
iF 
Sex of teacher: Female | Male | M&F |Female| Male | m&F | M BF ast 
No. of classes: 718 116 144 510 441 iit ne 
Understanding teacher .32* .28* .31* | 08 l1* .07 -02 ag 
behavior 99" 
Businesslike teacher .36* .29* -31* | .15* Aue -00 -16 ‘ 
behavior 13 
Stimulating teacher .33* 27* .29* | 99+ -16* 20* 197) 
behavior 14 
Favorable attitude to- -18* 15 10 -03 -07 — .03 —.09 
ward pupils 8 
Favorable attitude -18* 14 -16* | .07 12* 02 | —.08 A 
toward democratic 
classroom 
Favorable attitude to- -16* -00 15 13* 11* 02 | —.11 06 
ward administrators . : — 
and colleagues 18 
Conservative educa- | — 14* | —.o4* | _ 0 = 14 | - 
tional viewpointse OF | 08 | — a7 “OL oe 
Verbal comprehension .07 —.06 Att —.03 3 
Emotional stability .13* 04 “16* e oa 3 a = . — .02 
Validity of response —.07 -00 .09 | 02 ‘og | — ‘01 02%] 4d 
* Asterisk used to note coeffici 


ents that di! 
® Samples A and C comprised of classi 
pupils and their teachers. 
» Samples D, E, F, and G comprised of classes of 
matics classes, 25% science classes, 30% English clas, 
language students and their teachers. 
° A high score on this characteristic 
liberal, child centered viewpoints, Th 
interpreted as indicative of a positiy 


Ta 


teacher characteristics studied, 
class was paired with the teache 
the characteristic under consid 
the product-moment correlatio 
computed. 


Po of each 
t's score on 
eration, and 
m coefficient 


ReEsuLts 


Elementary classes. Data from the 
three elementary school samples sug- 
gest moderate, statistically significant 
(.05 level or beyond) interdependency 
relationships between assessments 
of observed purposeful and produc- 
tive pupil behavior and inventory 
estimated understanding, friendly 
teacher behavior; organized, business- 


dary school st 


secon: 
ses, and 23% social 


ffer significantly (.05 level or b 
ee eyond) from .00. 6 
of Grades 1-6 pupils and their teachers; Sample B, of classes of Grades 


he- 
udents and their teachers (roughly 22% sere 
Studies classes); Sample H, of classes of fore 


earning centered 
ition betwi 


like teacher behavior; and original, 


stimulating teach i 
Although a er behavior. 


Coefficients ar 
cant from sa 

ence seems 
relationshi 
and the j 
characterist 


Tworable attitude towar, 
careom Procedures ang liberal ot 
child (Permissive) 


ona between 
toward pupils and emotional serttude 
ility. 


TEACHER CHARACTERISTICS AND PUPIL BEHAVIOR 


Pe A and G, involving female, 
tea re and, female teachers, respec- 
ere’ he yield low positive correla- 
loathe etween pupil behavior and 
- = aerate favorable attitude toward 
the isterators and colleagues, but 
ing ge for Sample B, male 
With t is of zero order. 
Seine the notable exception of verbal 
ie gence all of the correlations be- 
i sa ca estimates of teacher 
~ eristics and observed pupil be- 
Sige statistically significant in 
mentary oe large sample of ele- 
tehehers, classes taught by female 
ee likely that classroom pupil 
féacher 1s related to a number of 
Sy a of the kinds 
stricted imated, when study is re- 
Soop be elementary school classes. 
tGhodl 4 lary classes. The secondary 
statistic a suggest substantially fewer 
ioe Hy significant relationships 
pupil b : e assessments of productive 
sessed ehavior, as observed and as- 
a nie ah may inventory estimated char- 
the Te = of teachers as measured by 
r cer her Characteristics Schedule. 
eralizabl appear that the only gen- 
resiited | relationships are those rep- 
obtgread by low correlations between 
estitnat pupil behavior and inventory 
teather of ; original, stimulating 
Wein, Ely ehavior—and, perhaps, be- 
the tea ‘i pupil behavior index and 
a8. cher characteristic interpreted 
Mie businesslike classroom 
er. 
ll of elementary and sec- 
dente school relationships. As had 
tain, peer from previously ob- 
ve ed evidence of correlations be- 
Bit ‘i assessments of directly observed 
en behavior and directly observed 
wr her behavior, interdependency re- 
Onships between pupil behavior 
nd inventory estimated teacher char- 
Acteristies were found to be distinctly 


95 


more apparent in elementary school 
classes and less discernible in second- 
ary school classes. 

For purposes of comparison of the 
pupil behavior-teacher characteristics 
correlations based on elementary VS. 
secondary school classes, consideration 
of the three elementary samples to- 
gether and the five secondary samples 
together appeared justifiable. (For 
each of the inventory estimated 
teacher characteristics, application of 
the appropriate chi square test for 
the z, transformations showed the 
values of the correlation coefficients 
between observed pupil behavior and 
the teacher characteristic under con- 
sideration to be no less homogeneous 
than might be expected from random 
sampling among the three elementary 
school samples and also among the 
five secondary school samples.) 

When the pupil behavior-teacher 
characteristic correlation coefficients 
of the three elementary samples are 
translated into Zr values and combined 
to obtain the mean 2, , and this mean 
compared with the mean @r value ob- 
tained by combining the five second- 
ary school samples, the hypothesis 
that correlations between pupil be- 
havior and teacher characteristics are 
more visible in the elementary as com- 
with the secondary school is 
at least for the teacher 
characteristics referring to warm, un- 
derstanding, friendly behavior; or- 
ganized, businesslike behavior; stimu- 
lating, imaginative behavior; fa- 
vorable opinions of pupils; favora- 
ble opinions of democratic classroom 
procedures; and child centered or per- 
missive (liberal) educational view- 
points. The mean elementary 2r value 
was significantly higher (.05 level or 
beyond; two-tailed test) than the 
mean secondary @r value for the in- 
ventory estimated teacher characteris- 
ties named. (Had the .10 level been 


pared 
supported 


96 


accepted as the criterion of signifi- 
cance, the pupil behavior-teacher emo- 
tional stability correlation also would 
have been significantly higher for the 
elementary compared with Secondary 
school classes, ) 

It may be noted 
nificance of the differences between z, 
values for the elementary Sample A 


were employed. When 
and their classes were 
mentary Sample B and Secondary 
Sample E, none of the differences 
between z, values attained the .05 
significance level—although the dif- 
ferences between the Correlations jn- 
volving the understanding, friendly ; 
responsible, businesslike; anq child 
centered educational viewpoints on the 
part of teachers would have been sig- 
nificant had the .10 level been em- 
ployed. Only two of the 2r differences 
—those involving understanding, 
friendly and responsible, businesslike 
teacher characteristics—met the .05 
criterion when the relatively smaller 
independent samples of male and fe- 
male teachers and their classes, Sam- 
ples C and F, were compared—the dif- 
ference of .23 between the elementary 
and secondary pupil behavior-teacher 
emotional stability z, values being the 
next largest difference, but this signifi- 
cant only had the .10 level been ac- 
cepted. ; . 
Comparison of correlations involv- 
ing male and female teachers. As a fi- 
nal note, added in light of frequently 
expressed interest in the sex of the 


Compared, ele- 


DAVID G. RYANS 


teacher as a variable in the — 
pupil relationship, it may be ngtee 
that the pupil behavior-teacher oe 
acteristics correlation coefficients ‘ 
tained from samples of classes of fe- 
male teachers and those obtained ee 
samples of classes of male Ln 
Generally are in the same ogee 
and although they vary slightly, Fe 
Variation does not exceed that whi s 
might be attributed to random pa 
pling. (When the correlations for are 
ples A and B, and similarly those 10 
Samples D and E, were compare; 
none of the differences in correlation 
for men ys, Women was significant = 
even at the .10 level.) The present da . 
do not suggest. systematic differences 
between men and women teachers ie 
Sofar as the relationship between ¢ a 
teacher characteristics and pupil be 
havior estimated here is concerned. 


Summary ; 

Tt was hypothesized that (a) certain 
teacher traits (characteristics) esti- 
mated from scores yielded by a self- 
report type inventory, the Teacher 
Characteristics Schedule, would be 
Positively correlated with pupil be- 
eo pees in the teachers’ classes, aD 
{©) pupil behavior-teacher character- 
evider|tionships Would be more 19 
€ in @ 1 ec 
ondary Schoo] prey alia 


erive -vali- 
al acne ‘a d and cross 


Tesponseg ni 
Cc Jes ay, ee bea 
( 1 teterstieg Schedule, Pupil behav- 
rei Observed and assessed, 
» DY trained sence 
me. School data sugges 
erate ye ationshing betiesen af 
Productive p,-served Purposeful and 


i : 

TY esti Pil behavior and inven- 
identifie « acher characteristics 
zs as de: 5 T°: 

ehavior?. « tst; 


anding, friendly 
«.;_?. businesslike 
°niginal, stimulating 


| 


TEACHER CHARACTERISTICS AND PUPIL BEHAVIOR 97 


paiaoter®; and low-order relationships 
between pupil behavior and teachers’ 
favorable attitudes toward demo- 
cratic classroom procedures,” “child 
a or permissive (liberal) edu- 
ee “favorable atti- 
Ky ee ek ematical 
ee teniing school data suggest sub- 
$s lally fewer relationships between 
ee _of productive pupil 
oe and inventory estimated 
oe ~ characteristics, The only gen- 

ahzable relationships seem to be 
ate between pupil behavior and in- 
9 rd estimated “original, stimulat- 
dk acher behavior,” and, possibly, 
peti Pupil behavior and inventory 
: ated “organized, businesslike 
€acher behavior,” 


Relationships between pupil behav- 
ior and inventory estimated teacher 
characteristics were less discernible in 
secondary school classes as compared 
with elementary school classes. 

No significant differences were ob- 
served between the pupil behavior- 
teacher characteristic correlations of 
men teachers as compared with women 


teachers. 
REFERENCES 


Ryans, D. G. Characteristics of teachers. 
Washington, D. C.: American Council 
on Education, 1960. 

Ryans, D. G. Some relationships between 
pupil behavior and certain teacher char- 
acteristies. J. educ. Psychol, 1961, 52, 


82-90. 
(Received September 6, 1960) 


Journal of Educational Psychology 
1961, Vol. 52, No. 2, 98-103 


RELATIONSHIP OF INTELLIGENCE TO STEP SIZE 
ON A TEACHING MACHINE PROGRAM 
CARLETON B. SHAY? 


University of California, Los Angeles 


Teaching machine programs have 
generally been written with little con- 
sideration for diff 


erent levels of intel- 
lectual ability. The programed teach- 
ing method itself has been considered 
sufficient to over 


come ability differ- 
ences, because most 


been written with 
allowing progressio 


intelligent, it seems likely there is a 


ence on 


r Ser: Porter’s 
study with Spelling Words, 


which found no relationship between 
IQ and achievement, Supports the con- 
tention that one program js sufficient 
for different ability levels; but results 
of a study by Briggs and Besnard 
(1956), using more complex materials, 
suggest that programs suited to differ- 
ent ability levels might be desirable, 

The present study investigated the 
hypothesis, in null form, that there is 
no relationship between intelligence 
and step size on a teaching machine 
program for each of the following eri- 


1 This article is based on the author's doc- 
t ep me at the University of Cali- 
tarais Los Angeles, under the direction of 
bar eens the Santa Monica, California, 
city schools. 


98 


rr, e” 
teria: total learning, learning tg 
materials, learning of mater se, iti 
volving “understanding,” errors; 
time to complete the eet after 
Size of item step was pete oll 
Lumsdaine (1959), as the - aad ia 
of giving the correct answer, he nur 
ferred from measurement of uh i 
ber of errors made on a Pre en gives 
proportion of errors on a progt a aver” 
a difficulty level from which a sponse 
age probability of correct re 
may be calculated thus: 


> Dd Bi 
7_ #4 
ee) =< Sa 
where $ 
aye rrec 
BIR} = average probability of ae : 
response R on items a the i* 
Es; = number of errors made he 
individual on the git item and 
2 = sum of the ¢ individuals; 
vi i= 


Sum of the 7 items. . 
The probability of responding © 
ectly to any given item is: 
> By 
es 


Both th 
average j; 


adjust and a 
€ programs used 


Mernop 


* Were written to cover 
1 = Ha roman numerals: 
he pr, 
author’s diners eetVailable with the 
interlibrary Joan Th, (Shay, 1960) through 
cf e: 


cover, however, "re under separate 
specifically. "nd ‘should be requeste 


INTELLIGENCE AND TEACHING MACHINE PROGRAMS 


Thi i 
Beary presents the symbols and princi- 
399 a of roman numerals to 
1957), a er, Merton, & Grossnickle, 
Sirever Diet with few exceptions, 
mf lanes not venture beyond 100. 
given to step program was written first and 
of Tie small (N ~ 4) samples 
sample fillers (IQ 2110) pupils, each 
average ca by item revision until the 
D90% no ability of correct response was 
ties of d most individual item probabili- 
large es eae response were 280%. This 
step a iow was similarly reduced in 
(IQ 90-109), samples of average students 
gram: this to form a medium step pro- 
small step” in turn, was revised to form a 
(1Q <89) pean for below average pupils 
sertion on ; ep size was reduced by the in- 
into two siete ging items, splitting an item 
ing stems Pssst simpler ones, or by reword- 
Port to th r responses to provide more sup- 
he ie slower student. 
tered to hree programs were next adminis- 
number ms sample of 90 students, an equal 
Program fue ability level and with each 
to Rear i he step size was then readjusted 
hey cont he final experimental programs. 
ed Pie yg 103 (large step), 150 (me- 
were eis ee 199 (small step) items, and 
small foe and ring-bound to form 
fel ee approximately 4” x 6” in size. 
Used to eae items in each program were 
ere. Mot ruct the response procedures and 
timing A ap omen in the error count or 
Were toe ey decreasing step size, programs 
View item ifficult and contained more re- 
trating a more varied examples illus- 
Cea of roman numerals 


Apparatus 


Tw - , 
ine oo, identical response devices were 
With feos 1 device consisted of a board faced 
« sana foil upon which was clamped 
matrix rae IBM answer sheet and an IBM 
punch bea faced with aluminum foil and 
Tespo a for the correct answers. The student 
aheaes ed by punching through the answer 
Gin with a stylus, completing an electrical 
_ uit to either a green or red light, in- 

ming the subject of his success. 


Subjects 

1 Ninety subjects were chosen from the 
ow fourth grade in four Los Angeles ele- 
Mentary schools on the basis of roman nu- 
Merals pretest and group intelligence test 
Scores. One additional subject was dropped 


because of y and a second because of 


99 


complete failure to follow directions. A third 
of the subjects were above average in in- 
telligence (IQ >110), one-third average (IQ 
93-109), and one-third below average (IQ 


<92), Adjustment of the customary lower 


limit of the average range Was necessitated 
ability students. 


by a lack of acceptable low 


Pretest and Posttest 

onsisted of 24 items, 12 to be 
written in roman numerals from arabic, and 
12 vice versa. They were all numbers from 
1-39 except for L, C, D, M, 40, and 44. Only 
the number § was common to both lists. 

The posttest consisted of 31 items: 10 
arabic numbers to be written in roman DU- 
merals, 14 of the opposite type, and 7 from 
a sequence which required counting by 100s 
to 1,000 in roman numerals, with 6,.D} and 
M given. The first two parts mentioned were 
arranged with old items first, followed by 
new numbers not taught in the programs. 
The old items were those numerals specifi- 
cally taught and presumably not requiring 
an understanding of roman numerals con- 
struction for recall. The term “rote,” as ap- 
plied to these items, refers only to the mini- 
mal amount of understanding necessary for 
learning and not to the method of presenta- 
tion. In contrast, the new items had not been 
taught and could be formulated only through 
an understanding of these principles. Most 
of the new items involved inversions (as in 
4 and 9) and were numbers between 100 and 
1,000. There were 12 old and 19 new items. 
Reliability of the posttest, as estimated by 
the application of Kuder-Richardson For- 
mula 20 to the scores of the experimental 
subjects, was aS follows: total posttest, 93; 
old items portion, 83; and new items por- 


tion, 90. 


The pretest c 


Procedure 
ministered to all stu- 
) 4B class. Group test IQs were 
f students whose 
, and sub- 
Subjects 
within each ability group were randomly 
assigned to one programs to 
form nine experimental groups of 10 sub- 
jects each. These groups, explicitly defined 
jn Table 4, were formed from each possible 
pairing of ability group and program. 
Subjects worked on the programs in 
o 12, typically including mem- 
bers from 5 or 6 of the experimental groups. 
In the first experimental session for each 
group, subjects were jnstructed in the re- 


groups of 7 t 


100 CARLETON B. SHAY 


sponse procedures with the first three items. 
Each subject completed 50 items a day on 
consecutive school days until he finished his 
program; this was followed immediately by 
the posttest. Performance measures ob- 
tained for each subject included old and 
new items posttest and total posttest scores, 
total errors, and time necessary to complete 
the program. 


RESULTS 


Tables 1 and 2 indicate the extent 
to which the programs met the criteria 
of average and individual item diffi- 
culties for the experimental groups. 

Values in the lower-left 


I to upper- 
right hand diagonal in each table, 
representing the criterion ability 


TABLE 1 


Mean NoumpBer or Errors anp Mean 
PROBABILITY Or Correct REsPonsE 


Ability Group 
Program Bel 
Averee Average es. 
Large step 
M 36.6 18.9 g4 
pIR;} 63.4 81.1 90.6 
Medium step 
M 34.0 15.3 15.0 
p{Ry} 76.9 89.6 89.8 
Small step 
M 22.1 25.6 17.2 
p{R;} 88.7 86.9 91.2 
TABLE 2 
NUMBER AND PERCENTAGE oF Items Wuosp 
Correct Response Propasinrry Eaquats 
or Excreps 80% 
Ability Level 
Program Bel 
Average Average Above. 
Large step 
We 40 73 90 
% 40.0 73.0 90.0 
ium step 
re 92 BL 137 
% 62.6 89.1 93.2 
eee ii 174186 
% 87.2 88.8 94.9 


group-program combinations, suggest 
that the programs were adequate of 
the study. Additional examination oe 
the location of items within each fat 
gram not meeting the individual es 
difficulty criterion revealed a che : 
tendency in the small step program = 
these items to cluster. This trend io, 
not considered detrimental to the = 
jects’ responses or to the further } 
terpretation of data. 

Desirability of covariance analyee 
of criterion scores with pretest oi 
as a control variable was indicated ry 
the following correlations: pretest- ae 
-33; pretest-total posttest, .66; prete 1 
new items posttest, .59; elaaeben 
items posttest, .66; pretest-error®, 
—.51; and pretest-time, —.11. ‘aii 

Covariance analyses of eee 
Scores are summarized in Table 3. T = 
assumption of homogeneity of regre 
sion was tested in each case and Apes 
to be satisfied. Inverse sine transfor 
mations of percentage error scores; 
and logarithmic transformations © 
ime scores were used to satisfy the 
“ssumption of homogeneity of aoe 
ance. The hypothesis that there is 2° 
telationship between intelligence a 
Size of item step could be rejected onl¥ 
in the case of percentage error, where 
an interaction was found (p < 05 

he main effect of ability level was 
obtained for each of the other eriterio® 
Scores, but no interaction. 
oat ee mean criterion scores for 
the va aati group on each or 

lables—new, old, and tot@ 


Posttest, time and _ jven 
in Table 4, ? errors—are g' 


Discussion 


Failure to reject most ll 
hypotheses asserted in ee i a 
Ports Skinner’s position that isis not 
necessary to provide More than one 
Program on the basis of differen t a ial 
ability. Table 4 suggests that ae all 


y 9 
INTELLIGENCE AND TEACHING MACHL 


NE PROGRAMS 101 


TABLE 3 


Coy ‘ 
VARIANCE ANALYSIS OF 


Crirerton Scores USING A Roman N 


UMERALS 


PRETEST AS A CONTROL 


Mean Square F 
Source a 3 i a 
3% wn |) 3 “ % 
g #3| 2 2 Fs #3 | #8 s 
5 gs ge| £ 5% | €3 ge| & 
. é se|E85)2| 2 | 2° | 8° la” Ky 
rams 
Abilit: 2 66.18) 28.21 501.5 
y 18) 28.2 §.13/501.50)1.93 1.80 [1.45 [1.30 27 .21** 
Interaction : 382.61]160.01 48. 56\180.25|1.36)10.43**|8.23"*|7.78** 19.17** 
ithin cells 68.62] 26.15|14.08 189.56) .02| 1.87 |1.-35 2.26 | 3.17* 22 
so | 36.70| 19.44] 6.24| 59.88) .07 ; a a 
6195 level dbieirai 
epee oe 
ead TABLE 4 
step Mgan Criterton Scorps AND SranpaRp DEVIATIONS 
Experimental group Total New items Old items Time Percentage 
posttest posttest posttest (minutes) error 
Ability 
eens Program M SD M SD M SD M SD uM SD 
Ove Averag | SS ee 
rage sates step 19.2 5.7 |10.3 | 4-9 8.8 | 1.3 49.0} 12.7 10.5] «7 
. ee step |19.9 | 7.7 10.8 | 5.9 9.1 | 2.4 76.1] 15.7 g.2| 1.0 
—_ all step |20.9 7.1 |12.5 | 4.4 8.5 | 2.9 90.0} 19.3 7.8} 1.5 
age 
ae step 12.9 7.4 16.5 5.0 | 6.4 3.4 | 75.6 14.7| 17.7 1.2 
ae cee step |18.3 6.2 |10.0 4.8 | 8.3 1.9 | 90.6) 17.5} 9-0 PA 
bei mall step {15.8 8.2 | 8.2 6.0 | 7.6 3.2 |122.0 22.7| 13.0 1.3 
ow Ay. 
erage eaty step |12.1 | 8.2 6.5 | 4-8 | 5.6 3.8 | 87.3] 82.9 32.5) 1.7 
Sa pedi 9.4| 7.0 | 4.5 3.2 | 4.9 3.9 |108.5) 35.1| 18.5 1.6 
mall step {16.4 | 8-3 | 8-4 5.3 | 8.0 | 3-5 135.5) 42.5] 9-3 1.2 


level; a: 
of . ability and for different types 
ho rial, as size of item step de- 
See 2, there is an accompanying in- 
Sonal in posttest score and time to 
‘4 ete the program. 
did 4 = naturally arises, why 
flee, Ac emai fail to reveal a 
P ae ship between intelligence and 
Goma step, assuming there is a rela- 
: ip? One possible reason is that 
eg ig were inadequate. Per- 
ye 0% probability of obtaining 
eae response is too low. This is 
ela ed by the inverse relationshi 
between posttest score and wep 


ent against this expla- 
nation is the possibility that extrane- 
ous factors produced a portion of the 
error, and the “true” error rate was ac- 
tually lower. Machine malfunction was 
known to cause some error, which 
varied according to the subject’s re- 
action occasionally seeing bo 
lights at once. Also, some subjects ex- 
¢ain confusion between an 
e number of the answer. 
if the question was 
«What is 5 in roman numerals?” and 
red at Choice Posi- 


bjects would answer at 


size. An argu 


tion 3, some su 


102 


noice Position 5 rather than 3. (In 
future experiments, lettered response 
positions are recommended. ) 

A second possible reason for not 
finding a relationship between intelli- 
gence and step size is that IQ did not 
Serve as an adequate predictor of 
learning ability in this study. Studies 
generally show only a moderate rela- 
tionship between learning ability and 
IQ. Perhaps group intelligence tests 
are less effective predictors of learning 
ability where programed teaching se- 
quences are involved than is the case 
in the ordinary classroom. Since pro- 
grams are carefully regulated in terms 
of step size, reading level, sequencing, 
ete., it may be that Seneral intelligence 
as it is usually Measured plays a 
smaller role in this type of learning 
than in the classroom where these 
variables are not controlled. An im- 
plication of this is that other, perhaps 
more specific measures of ability 
should be used to predict learning 
from programed teaching. 

A third explanation is the 
that a “real” relationship w. 
by the confounding of cer 
relationships and the experimental de- 
sign. Considering the correlations be- 
tween pretest-IQ and pretest-posttest, 
and assuming that IQ also contributed 
to the variance in posttest Scores, the 
use of pretest scores as a control vari- 
able reduced the contribution of IQ 
to the variance of the criterion vari- 
ables. An implication of this explana- 
tion is that to test the hypothesis 
adequately, it would be necessary to 
use some material such as an artificial 
language, whose pretest scores would 
be essentially equal and unrelated to 
1Q. A further reason for not obtaining 
significant interactions is the i 
bility that 10 eer sale were too 
ap oS ee hme deals with the 
definition of step size. If step size had 


possibility 
as obscured 
tain pretest 


CARLETON B. SHAY 


been defined in terms of the << 
language or prior experience neces step 
to make the transition from me likely 
to the next, it is probably more peed 
that interactions would have 
found. ‘ é 
Now let us assume that in _ ea 
is no relationship between abili Bae 
step size. This implies, as Ped a 
gests, that for programed teac is the 
minimally-small step program f abil- 
most appropriate for all levels that i 
ity. This would further suggest ces is 
provision for individual differer some 
to be made, it should be made on lone- 
basis other than intelligence ‘en the 
Such a provision is indicated a 
time data, perhaps. Cate stu- 
and small step programs, brigh ne to 
dents took almost twice the wa % 
improve their score 1.7 points, an ch a 
increase, In some situations, jition® 
gain may not be worth the addi a 0 
expenditure of time, and the us jn- 
more than one program would be 
dicated, we the 
It would be desirable to explore ith 
hypothesis of this study further, ne 3 
the following differences: more aii 
Jects; easier programs; different sd 
mg material, such as an artificial <a 
guage; different age groups; and use hs 
different, Predictors of “learning 8b} 
ity” from Seneral to task-specific. 


Summary 
This stu 


dy examined the hy pothes!5 
ene 38 no relationship bette” 
teaching «° and size of item step on” 
criterion M&chine program under — 
leachine *onditions of total learning: 
learning o¢ \°!Ving rote materials, 
standing. 5 ron etal involving under- 
the Btomia e and time to complete 
as the difficuty €P size was defined 
answer anq ay giving the correct 


si eae 
of error, “ted by two criteria 


INTELLIGENCE AND TEACHING MACHINE PROGRAMS 103 


<a programed teaching sequences 
me , 150, and 199 items were devel- 
be ig fourth grade roman 
jer a s. Each was written for a given 
_ — ability, and step size adjusted 
wn stated criteria. Ninety fourth 
ican were selected on the basis of 
Tom and intelligence test scores. 
Be each of three ability levels, three 
sakes a 10 subjects were randomly 
oma “7 and each group assigned to 
Stine — programs, making nine ex- 
50 tiene al groups. Subjects completed 
Socata a day on successive days until 
ietiaiat completion, which was im- 
No, ely followed by the posttest. 
tial ws the tested null hypotheses 
tar rejected except in the case of 
Within i error score. The results, 
‘ndiaet he definition of step size used, 
Rtmean that if there is a relationship 
is me eg intelligence and step size, it 
died ais lai one. This would suggest 
sary on re programs are not neces- 
4 he basis of ability alone. Fur- 
studies were suggested. 


REFERENCES 


Brices, L. J.. & Besnarp, G. Experimental 
procedures for increasing reinforced 
practice in training air force mechanics 
for an electronic system. In G. Finch & 
F. Cameron (Eds.), Symposium on air 
force human engincering, personnel, 
and training research. NAS-NRC Publ., 
1956, No. 455. Pp. 48-58. 

Brueckner, L., MERTON, E., & GrossNICKLE, 
F. The new learning numbers. New 
York: Winston, 1957. 

Lumsparne, A. A. The development of teach- 
ing machines and programmed self-in- 
struction. Paper read at Conference on 
Status of Research and Experimentation 
on the Newer Instructional Media. Cen- 
ter for Advanced Study in the Behav- 
joral Sciences, Stanford, November 1959. 

Porter, D. Some effects of year long teach- 
ing machine instruction. In E. Galanter 
(Id.), Automatic teaching: The state of 
the art. New York: Wiley, 1959. Pp. 85- 
90. 

Suay, C. B. The relationship of intelligence 
to size of item step on & teaching ma- 
chine program. Unpublished doctoral 
dissertation, University of California, 
Los Angeles, 1960. 

Sxrvver, B. F. Teaching machines. Science, 
1958, 128, 969-977. 


(Received August 9, 1960) 


Journal of Educational Psychology 
1961, Vol. 52, No. 2, 104-107 


TRANSFER OF VERBAL MATERIAL ACROSS SENSE 
MODALITIES 


PAUL PIMSLEUR axp ROBERT J. BONKOWSKI! 


University of California, Los Angeles 


One of the main arguments against 
the teaching of the spoken foreign lan- 
guage in high schools and colleges 
maintains that there is not enough 
time in the average course in which to 
teach both speaking and reading. 
However, some writers contend that 
students who learn first to speak the 
language will, within the normal 
course time, catch up to or perhaps 
even surpass in reading ability those 
who have been taught reading all 
along. The argument of those who es- 
pouse this view will be bolstered if it 
can be demonstrated experimentally 
that aural learning facilitates visual 
learning. 

Experiments bearing directly on the 
transfer of verbal material across the 
modalities of vision and audition are 
rather scarce in the literature of ex- 
perimental psychology. Weissman and 
Crockett (1957) demonstrated that 
transfer does occur from auditory 
training to visual discrimination. Post- 
man and Rosenzweig (1956) suggested 
that transfer from visual training to 
auditory discrimination is greater than 
conversely. However, in both of the 
above studies the investigators were 
concerned with the thresholds of rec- 
ognition of verbal material. —_ 

The present. study addressed “i 
to several questions raised by the 
controversy over the teaching of 
guages: (a) Does aural learning a 
tate visual learning? (b) Is this Sg 
tation greater than that achieve’ it 
presenting the material first visu 4 

3 eir grati- 
ear te Nose. a ft paceing tag ok oe 


ful suggestions and comments on a draft of 
this paper. 


] 
and then aurally? (c) In terms aye 
time, is it more economical - 
by the aural-visual order eae ee 
visual-aural order (i.¢., does ae 
dent take fewer total trials to oe 
verbal material both visually anne 
rally when the material is pe Hs 
first aurally and then visually) daa 
firmative answers to these ae tee 
would lend some support to eer 
voring the aural approach to lang 
teaching. 


Merruop 


paid 

Subjects. The subjects were ian 
volunteers recruited from ae course= 
enrolled in second semester Spanish sisted 
(Spanish II) at UCLA. The group 
of 10 females and 18 males. is desig? 

Design. A treatments ae as used: 
(Lindquist, 1956, pp. ae nto tv 
The subjects were first divide rade f 
levels: those who had received “db: and 
A or B in Spanish I at UCLA Gor D 
those who had received a grade 0 ch level; 
in Spanish I at UCLA (14). At “fo one of 
subjects were randomly assigned ic ofr 
two conditions by the use of a table 
dom numbers. . ere 

Procedure. In the present cee be 
were two treatments. The subjects le mo- 
a list of paired associates through one ugh 
dality, and releamed the same list ine 76s 
another mod: 
learning wer 
consecutive 
learned first 
of the list 


ane 


A 
Group V~ 


original learning scores one could 
be compared with the Teléarning scores o 
the other group as in a simple transfer 
paradigm, Bre: lt 


The list consisted of 10 Nonsense dissyl- 
104 ; 


TRANSFER ACROSS SENSE MODALITIES 


lab! . 
ore oh paired with one of 10 color 
(1933) P ‘Se dissyllables, from Dunlap’s 
mists As » were chosen to meet require- 
aurally ar of spelling, ease of recognition 
striction Saale and similarity in con- 
Werle than pronounceability to English 
in Table 3 stimulus-response pairs are listed 
Scr bes visual presentation the stimulus 
and then penises on a screen for 1 second 
the oe After a 4-second interval 
ond. The se word was projected for 1 sec- 
paige Pg ge a 4-second interval between 
Were “piesa he aural presentation the pairs 
aeubnd “an by tape recorder, with a 4- 
Tesponse a: at both between stimulus and 
took ABBE between pairs. Since a word 
intervals second to pronounce, the time 
Usin were comparable. 
condition ee anticipation method in each 
ag reson he subject responded by writing 
a strip on a line of a record sheet using 
sponses sal to cover previous re- 
recorded e 10 responses in each trial were 
The 10 on successive sheets in a booklet. 
random hw were presented in different 
dition, but ers for successive trials in a con- 
kee nitions, orders were the same between 
rege ee were run in small groups 
experimente, two to four in order that the 
Prevent as could maintain control and 
structed - eating. The subjects were in- 
aving ch leave the room quietly after 
additional a criterion in order to prevent 
rst. The earning by subjects who finished 
Was begu relearning phase of a condition 
at one ti as soon as all subjects being run 
origin me had reached criterion in the 
ae learning phase. 
ne were instructed to learn the 
a ae of 10 foreign words.” Virtually 
original ‘ instructions were given before 
iects did earning and relearning so that sub- 
foreign not know in advance that the “10 
Mle ’ and their “meanings” in the 
orig; ing phase were the same as in the 
inal learning phase. 


Resutrs AND DIscUSssION 


ee dependent variables were trials 
z d errors to criterion. The correla- 
ons between these two measures were 


89 for the original learning phase and 


90 for the relearning phase. Statistical 


tests based on both measures yielded 
only the 


Similar results. Therefore, 
data for number of trials are reported. 


105 


TABLE 1 


Tpn Pairs oF DissYLLABLES 
anp CoLtor NAMES 


polef green fimur yellow 

medon purple runil brown 

defig red kupod white 

nigat pink tarup black 

gokem blue latuk orange 
TABLE 2 


Mean TRIALS TO CRITERION IN ORIGINAL 
LEARNING AND RELEARNING 


Original Learning Relearning 
Spanish I Grade 

Group | Group Group | Group 

-A A-V V-A AV 
AorB 8.86 | 6.86 | 2-71 2.29 
CorD 14.57 | 9-14 4.14 | 2.57 
Group Mean 11.71 | 8.00 3.43 | 2.48 
4.67 | 2.78 2.87 .62 


Group SD 


The data for the analysis of amount 
of transfer are summarized in Table 2. 
Comparing the relearning scores for 
each group with the original learning 
scores of the other group suggested 
that there was 4 marked positive 
transfer across modalities. Aural origi- 
nal learning facilitated visual relearn- 
ing and the converse Was also true. 
The facilitation of visual learning by 
prior aural learning may be viewed as 
a finding in favor of the aural ap- 
proach to language teaching. 

Stevens’ (1951, p. 557) formula for 
the savings score was applied in the 
analysis of the differential degree of 
transfer between the two conditions. 
The mean savings scores for Groups 
y-A and ‘A-V were 89.5 and 94.6, re- 
spectively- This showed a tendency for 
subjects in the A-V condition to “save 
more” than subjects in the V-A condi- 
tion. However, the savings score is to 
some extent dependent upon the cri- 
terion set by the experimenter. There- 


fore, an attempt was made to get a 


106 


TABLE 3 


Summary oF ANALYSES oF VARIANCE OF 
OrtcinaL LEARNING AND OVERALL Data 


Original Overall 
Learning (OL + RL) 
Source df 
MS F MS F 
Sense Mo-| 1 | 96.59'8.27*« 155.64'5.86* 
dalities | 
Grades 1 |112.01/9.59**!1165.20'5.29* 
SM xX G 1 | 20.56/1.76 36.65)1.38 
Error 24 | 11.68) 26.56 


* Significant at .05 level. 
** Significant at .01 level. 


more meaningful analysis of the dif- 
ferential degree of transfer, 

In the original learning phase sub- 
jects at both levels of Group V-A 
appeared to have taken more trials to 
criterion, on the average than their 
counterparts in Group A-V, Similarly, 
looking at the performance of subjects 
in each group suggested that the over- 
all mean for Group V-A was greater 
than that of Group A-V in the original 
learning phase. An analysis of vari- 
ance was employed to test the statis- 
tical significance of these differences, 
A summary of this analysis appears in 
Table 3. The significant F for Sense 
Modalities suggests that the visual 
learning was more difficult than the 
aural learning in the original phase. 
This finding indirectly supports the 
conclusion that there was more trans- 
fer in the A-V condition than jn the 
V-A condition. For if one can agree 
that the visual task was in fact more 
difficult, one would not expect subjects 
in the A-V condition to take a smaller 
mean number of trials to criterion in 
the relearning phase. Nevertheless, 
subjects in the A-V condition obtained 
a mean of 2.48 trials for the relearn- 
ing phase as opposed to a mean of 3.43 
trials for subjects in the V-A condi- 
tion. It should be stressed that this is 
offered only as indirect support of the 


PAUL PIMSLEUR AND ROBERT J. BONKOWSKI 


conclusion that there was ean 
transfer in the A-V than in the V- 
condition. . 

The most significant finding, ane 
perhaps the strongest support for h to 
advocates of the aural screen 
language teaching, resulted oa 
analysis of the overall (OL ee 
scores. Combining the number of f the 
for each subject in both phases 0 ave 
experiment yielded scores that ce 0 
some indication of the ape eee 
the order of presentation of verba is 0 
terial. A summary of the analyst the 
variance of these data is given 12 
last column of Table 3. i- 

The significant / for Sense Se 
ties in this case suggests that su ae 
in the A-V condition took on the aa 
age fewer trials to both criteria ce 
Subjects in the V-A condition. ne 
it took the students fewer total peat 
to learn the verbal material both hia 
ally and aurally when the ie a 
Was presented first aurally and 5 
visually. This has the implication a 
it might be more economical, im tet oF 
of time, to teach verbal material e 
the aural-visual order than by * 
visual-aural order. in 

The significant F for Grades both # 
the original learning and the over 
data suggested that those subject 
who achieved grade of A or BP 
Spanish I took fewer trials to criterio” 
on the average, than those subjects 
who achieved a grade of C or D. 
itself, such a result might have bee? 
predicted by the experienced languag? 
teacher, However, the Grades effect 
18 of importance also in considering 
the methodology of transfer studies 
Similar to the present study. The de- 
Sign used afforded a neat method of 
matching the groups as well ag inereas- 
Ing the efficiency of the experiment. 
In fact, had the Sroups not been so 
matched, the estimate of the error 
mean square would haye been in- 


TRANSFER ACROSS SENSE M ODALITIES 


creased. 
square 


The inflated error mean 
eal 2 — have reduced the 
Bae A obtaining a significant 

e e Modalities effect. 
shi Phi of the results of this 
rk Investigators _coneluded: 
learnt ies does facilitate visual 
‘hen irae his facilitation is greater 
lee ee! facilitation of aural relearn- 
he heer original learning, and it 
rial er time to teach verbal mate- 
the oon and aurally when 
and then stnuatly esented first aurally 


SuMMARY 


Al 
siete of 10 paired associates (dis- 
ag hepa iad stimuli and color names 
rst thr ses) was randomly presented 
notion ough one modality and then 
subjects ma modality. Half the 
a oied wana Bat list first. through 
it funepes ome and then relearned 
other he ; st auditory modality. The 
a order, earned the list in the oppo- 
disse” [goa was found in both 
aural preg t was suggested that the 
presentation had a greater facili- 


107 


tating effect upon the visual presenta- 
tion than conversely. The subjects 
took fewer total trials to learn verbal 
material both visually and aurally 
when the material was presented first 
aurally and then visually. These find- 
ings seemingly offer some support for 
the view that aural instruction pre- 
eeding visual instruction may have 
advantage over conventional methods 
of language teaching if the goal is to 
achieve proficiency in both reading 
and aural comprehension. 


REFERENCES 

Dunuap, K. List of 43,200 dissyllable words 
and paralogs. Washington, D. C.: Na- 
tional Research Council, 1933. 

Linpautst, E. V. Design and analysis of ex- 
periments in psychology and education. 
Boston: Houghton Mifflin, 1956. 

Postuan, L., & RoseNZWEIG, M. R. Practice 
and transfer in the visual and auditory 
recognition of verbal stimuli. Amer. al 
Psychol., 1956, 69, 209-226. 

Srevens, S. 8. Handbook of experimental 

psychology. New York: Wiley, 1951. 

: & CRocKETT, W. H. Inter- 
sensory transfer in the visual and audi- 
tory recognition of verbal stimuli. Amer. 
J. Psychol., 1957, 705 283-285. 


Ss. L., 


WEI 


(Received August 29, 1960) 


Journal of Educational Peychology 
1961, Vol. 52, No. 2, 108-111 


+ OL 
VERBAL CONCEPT LEARNING IN HIGH ah 
STUDENTS WITH PICTORIAL AND VE 
REPRESENTATION OF STIMULI 
WILLARD N. RUNQUIST axp VALORIE H. HUTT 


Pomona College 


Within an associatio 
concept formation has 
of as the process by w 
stimuli come to consis 


same response. Obviously, one impor- 
tant factor determining the rate of 
concept formation, then, would be the 
ease with which the stimuli elicit the 
common response, 

In order to study the formation of 
verbal concepts, Underwood and Rich- 
ardson (1956a) scaled a Series of com- 
mon nouns in terms of their dominant 
descriptive associations. Concept ma- 
terials were then formed by selecting 
groups of nouns having the same dom- 
inant association. These were pre- 
sented to subjects who had to discover 
and learn the co: 


rrect associations, 
Further studies (Freedman & Med- 
nick, 1958; Underwood, 1957; Under- 


wood & Richardson, 1956b) utilizing 
these materials have explored severg] 
variables affecting the rate of concept 
learning. 

In all of the studies cited above, 
the stimuli (the nouns) were al] pre- 
sented verbally (i.e., in terms of the 
word representing the object). It has 
been suggested (Underwood, 1952) 
that other modes of presentation might 
have some effect on the rate of con~ 
cept learning of the type described 
above. The purpose of the present 
study was to compare the learning of 
verbal concepts with pictorial repre- 
sentation of stimuli with the more 
standard verbal ieee ene 

It would seem that a case might be 
made for the superiority of either con- 
dition. Verbal stimuli might be easier 


n framework, 
been conceived 
hich a group of 
tently elicit the 


e pic 
because irrelevant parts of route 
tures could interfere with he corre 
tion or discovery of i with pic 
responses. On the other han orrect. 28° 
torial representation, the “adjective) 
sociation (a descriptive ent from 
could be made quite app arile jn the 
the nature of the picture, mtation, 
case of the verbal represen the objee 
Subject is free to “picture” | and may 
any number of possible pies interfer? 
Picture it in such a way as to rrect 2° 
with the discovery of the co é 
sponse. -eleval 
co order to provide data re ions 
to these hypotheses, three “A study: 
Were utilized in the presen t), Pie 
Verbal (the name of the objec "860° 
ture Dominant (with the aera ae d 
ciation emphasized by the Pi 


the 
and Picture Nondominant (mized) 
Correct association de-empha as 
G 


rade level was also incorporater 
one variable in the design, as it 4 en- 
ccsirable to be able to extend and 66%) 
eralize any results to include se¥' 
educational levels if possible. 


Meruop ‘ 

° : j wel 

Materials. The concepts and stimuli W° d 
chosen directly 

Richardson’ 


n. The second a? 


drawings of the objects 
108 


VERBAL CONCEPT LEARNING 


In one ere 
which Ao’ = descriptive characteristic 
emphasized by th the correct response was 
nant). In th y the drawing (Picture Domi- 
sized (Pict e other set, this was de-empha- 
a bed aa Nondominant). For example, 
billowy for saa as appearing soft and 
a the i end looking ike ab army 
ubj ag r 
at sn Se ciel high school students (15 
eleventh ye grade levels: ninth, tenth, 
in this aaa | twelfth) served as subjects 
each were ee Three groups of 20 subjects 
grade pes with 5 subjects from each 
attempt, ty each of the three groups. No 
Subjects ha made to match groups, with 
ance, The 1 assigned in order of appear- 
rom coh eon of all students ranged 
© concept, ach group learned one set of 
Aba Be materials. 
vidually, ‘soa Each subject was run indi- 
pest age at a table facing the ex- 
or 3 second timuli were exposed manually 
tween the s with a 2-second interval be- 
electronic rs Intervals were timed by an 
striigtions were 4 with an audible click. In- 
Used by U; iis essentially the same as those 
he ease erwood and Richardson (1956b). 
“Second int was told to respond during the 
those i with descriptive words, i.e., 
instructions we size, shape, texture, etc. The 
lzed ag it were not completely standard- 
understood th necessary that each subject 
Phant-big) the task. One example (Ele- 
escriptive vee used in which examples of 
Were also t words were given. The subjects 
Correct, an _ that only one “response” was 
four res d that the whole list took only 
right” eames: The experimenter said 
Nothing ft all correct responses and said 
Correct}y the subject had not responded 
Ceived 1B ithin 3 seconds. Each subject re- 
cards be; repetitions of the list, with the 
ing shuffled after each repetition. 


RESULTS 


ay 
home eee number of correct re- 
rine | for each group over all 15 
5 te shown in Figure 1. Analysis 
oth iance of these data showed that 
Eel mode of presentation and grade 
eas were significant (F for mode of 
gee 15.26; for grade level 
na Individual t tests showed the 
: al instances produced better per- 
by Subjects for this study were provided 
zusa High School, Azusa, California ; 
elson Price, Principal. : 


109 


TABLE 1 
Concert Matertats UseD 


Concept Instances 


baseball 
doughnut 
barrel 
spool 


round 


bed 

fur 
pillow 
moccasin 


soft 


fang 
fishhook 
hatchet 
knife 


sharp 


lizard 
earthworm 
oyster 
snake 


ee 


emphasized 


formance than the picture 
instances (t = 2.67, P < 01) and the 
picture emphasized instances better 
performance than the pictures de-em- 
phasized (¢ = 2.54, p < 02). As to 
grade level, the only significant differ- 
ence was between Grades 11 and 12. 
There was no jnteraction between 
mode of presentation and grade level 
(F = 1.29), even though the graph 
suggests that the differences were 
larger at the lower grade levels. 


Discussion 


The most important finding of this 
study is that performance in concept 
learning was better when the stimuli 
were in the form of words than when 
they were represented pictorially. 
There are several interpretations, two 
of which seem likely. First, the sub- 
ject is using the same medium as the 
stimulus in making his response, i.e., 
he is looking 
ing with a word. Sin 


here are common ones, D 
to assume that the concept and stimuli 


110 WILLARD N. RUNQUIST AND VALORIE H. HUTT 


9 10 I 12 
GRADE 


' Don 
ction of grade level for Verbal, Picture 


TOTAL CORRECT RESPONSES 


Fig. 1. Mean correct r 


esponses as a fun 
inant, and Picture 


Nondominant instances 


have been associated during the sub- 
ject’s prior experience, Hence, in emit- 
ting the correct response, the subject 
may not “form an image” at all, but 
merely responds with a highly likely 
verbal associate. Karwoski, Gramlich, 
and Arnott (1944) came to somewhat 
the same conclusion from their data 
which show a longer reaction tim 
objects than to words. 

The interpretation which 
most plausible, however, 
these findings to the partic 
cepts which were used. Such qualities 
as “soft,” “sharp,” and “slimy” are 
primarily tactual rather than visual, 
It is very likely, then, that Pictorial 
representation of these objects sug- 
gested visual associations such as 
color, form, ete., or at least interfered 
with the production of an associ 
of a nonvisual nature. One test Oy 
might have been to determine w 
the “wrong” associations or “fir 
sociations given to the inst, 
these concepts were of thj 
However, the mechanies of 
tering the task precluded ; 
these responses. Certainly, the hy- 
potheses advanced here would seem- 
ingly predict that Underwood-Rich- 


e to 


seems 
attributes 
ular con- 


ation 
f this 
hether 
st” as- 
ances of 
S nature. 
adminis- 
recording 


to 

ardson Dominance Values ye 
the pictures would be differen this, 
those obtained with words. a sine? 
however, would not be crucia eces* 
the Dominance Values do iene 
sarily represent the relative stt saualh 
of responses within a single indivi sub- 
but merely the percentage of the nse 
jects who gave a particular resP' re- 
as the first association. Tatenienine je 
Sponse tendencies might not be put 
fested in these Dominance Values; ns 
could appear as lengthened resP9 
time. that 

This interpretation suggests the 
broad generalizations regarding © 
relative effectiveness of pictures, ih 
jects, and words as materials for ty 
cept formation will in all probabili 
not be found. As is the case in eel 
complex learning situations, the effe¢ 
of a particular variable will probaly 
be found to depend upon the spec! 
nature of concepts to be learned. 

Mention perhaps should be made of 


the fact that these results are contrary 
to those obtained in a 


3 -. PPosed to wor 
stimuli and attribute these results to 


VERBAL CONCEPT LEARNING 


the ereater simjlari 
en similarity of the verbal 
ies pe os reconciling the present find- 
might n these results, two things 
laete - pointed out: First, the simi- 
diffe. _ the materials used here might 
Monae ieeuiice cd from that in the 
bere ie used by Wimer and Lam- 
tities, used actual objects, not pic- 
the got similarity does not act 
ag iS paiead concept formation studies 
att a Bey -associate learning and, in 
ot he . similarity between instances 
facilitate” concept should actually 
1958). Ty acquisition (Richardson, 
Siepebiane there is probably no in- 
stil a ility of the present findings 
The 20 of Wimer and Lambert. 
ie that the more advanced 
probabl performed better on this task 
When may be attributed to in- 
dition g mental age. There is, in ad- 
differen some suggestion that the 
ne ces between methods of pres- 
twelfth , ere not as marked in the 
thow sd aders as in the other groups. 
hone : statistically speaking, the 
trend hl ive conclusion is that the 
Beste a unreliable, its regularity sug- 
rae a Teal phenomenon. It could have 
li ay by something in the 
i gts 0 the associations elicited by 
¢ A at or perhaps by a more so- 
is ser ated approach on the part of 
rate — advanced students. At any 
what ny explanation for this some- 
Sion? questionable phenomenon must 
ait further exploration. 


SuMMARY 


“ne formation of verbal concepts 
He investigated in high school stu- 
i nts utilizing instances consisting of 
. ords, pictures of objects with the cor- 
eet concept accentuated, or pictures 


111 


of objects with the correct concept de- 
emphasized. Four different grade lev- 
els were combined with the three 
modes of presentation of the concepts 
in a factorial design. The results 
showed that concept learning was best 
when the instances were presented as 
words, followed by the picture stimuli 
with the concept accentuated. The re- 
sults were attributed to the nature of 
the specific concepts studied, and it 
was concluded that in all likelihood no 
simple answer to the question of which 
type of presentation of instances is 
superior can be obtained. 


REFERENCES 


Treepan, J. L., & Mepnicx, §. 
attainment of concepts as & 
response dominance varian 
Psychol., 1958, 55, 463-466. 

Karwosk1, T., GRAMLICH, F. W., & Aryort, 


P. Psychological studies in semantics: 
s to words, 


I. Free association reaction: 
drawings, and objects. J. soc. Psychol. 
1944, 20, 233-247. 
Ricrarpson, J. The relationship of stimulus 
similarity and number of responses. 
exp. Psychol., 1958, 56, 478-484. 
Unperwoop, B. J. An orientation for research 
on thinking. Psychol. Rev., 1952, 89, 
209-220. 
Unpverwoop, B. J. Studies of distributed 
practice : XV. Verbal concept learning 
as a function of jntralist interference. 
J. exp. Psychol. 1957, 54, 33-40. 
UnpdERWoOOD, B. J, & Rrcwarvson, J. Some 
verbal materials for the study of concept 
formation. Psychol. Bull., 1956, 53, 84- 


95. (a) 
Unperwoop, B. J, 


concept learnin 
tions and dominance _ level. 


Psychol., 1956, 51, 229-238. (b) 
C., & Lampert, L. The dif- 


A. Ease of 
function of 
ce. J. exp. 


& RIcHARDSON, J. Verbal 
gas a function of instruc- 
J. exp. 


Winer, C. 
ferential ¢ 
stimuli on the lea 
ates. J. exp- Psyc 


(Received September 26, 1960) 


Journal of Educational Psychology 
1961, Vol. 52, No. 2, 112-115 


GRADES AS REINFORCING CONTINGENCIES 
AND ATTITUDE CHANGE 


BAUM 
ROBERT N. BOSTROM; JOHN W. VLANDIS? axp MILTON E. ROSEN 


State University of Iowa 


Although the use of marks and 
grades in educational practice has re- 
ceived considerable attention in the 
research literature, little if any infor- 
mation is available about the effects 
of grades on the students to whom 
they are given (Odell, 1950). The 
present study was designed to examine 
the effect on attitudes of differential 
assignment of grades for performance 
on attitude related essays. 

The basic hypothesis guiding the 
study was that grades will serve to af- 
fect behavior in the form of a rein- 
forcing contingency. More specifically, 
a “good” grade should serve to effect 
repetition of the responses which it 
followed, while a “Door” grade should 
produce reduction in the potentiality 
of appearance of the preceding re- 
sponses. 

In the present study, students re- 
sponded to attitude scales and several 
weeks later were requested to write es- 
Says on topics related to these atti- 
tudes, adopting a position incongruent 
with their measured attitudes. At ran- 
dom, grades were assigned to these es- 
says. It was predicted that students 
who received an A for their essays 
should change more in the direction 
of their essays on a subsequent atti- 
tude measure than a group that re- 

ived a D. 
te applying the hypothesis of the 
reinforcing effects of grades to atti- 
tude changes in the present context, 
the formulation presented by Doob 
(1947) is being followed. Doob views 

1 in the Department of Speech, 
PS tal Pom College. 


2 Now in the Department of Speech, Uni- 
versity of Connecticut. 


an attitude as an implicit, an Ee 
response which mediates over 4 from 
ior but which in turn is Sails 
the reinforcement of nee 0 
For Doob, reward or avol he reil- 
punishment may ag oer the 
forcing contingency. According?) , 
reinforcement of attitude 1 ted to 
overt statements may be expee es iD 
be functionally related to penne is 
measured attitude. Evidence “ 
ing this view has been ‘ae pres 
Scott (1957, 1959a, 1959b). 4 Doob’s 
ent study attempts to eee educa 
formulation into the area 0 
tional practices. lit 
Scott has proposed the possibit” 
that verbalization of a om of it- 
posed to initial attitude in Sah rein- 
self may produce change wi jeading 
forcement or nonreinforcement he new 
to stability or extinction of t (1954) 
response. Janis and King 10° 
showed that verbalization pene ee 
duced change. The relative con net 
tion of verbalization alone in con tine 
to the effect of the consequent con 
Sencies could not be examined ie 
Scott because of the absence of a © ae 
trol group which experiences no ae ‘ 
uences of their verbalizations. In t * 
Present xperiment, a group was 12 


cluded that received no grade follow" 
ing their essays. 


Meruop 
10-nett item duestionnaire containing four 
Pe See Attitude seq es were administere 
Skills classeC!S enrolled in Commnaniention 
Subject <8 2t the State University of Iow#- 

sine es tianonded to eac) item on a five- 
to strongly dismeransing from strongly agree 
dealt with federal aig to aut Attitude scales 


1s “dueation, legalized 


GRADES AS REINFORCING CONTINGENCIES 


gambli . s 
fa ee oo punishment, and socialized 
aid to aa tesponses to the scale on federal 
thie exit wo were extremely skewed and 
unreliable aesiones scale proved to be 
rom furth ese two scales were discarded 
marily for Wine ae and served pri- 
tration of ae items, The initial adminis- 
y the class Je neo was conducted 
Ti : 
Weeks Fh etoemipe period, approximately 6 
e subject wing administration of the scales, 
to write « - Were asked by the experimenter 
6 Siar « on particular assigned topics. 
appearing at was instructed by directions 
brief esse the top of a sheet to write a 
legalized y Sa pee a position on either 
cores on 5a ling or socialized medicine. 
Position th c attitude scales determined the 
subject mad was assigned. In each case, the 
he position instructed to write supporting 
is ipstect opposite to that indicated by 
Seale score Scale score, ie, if the subject’s 
gambling  apenenge favorability to legalized 
tion to rH he was asked to write in opposi- 
of topie Elles gambling. The designation 
on the Paes a particular subject was based 
scales, Th ength of his initial position on the 
the aubjec scale chosen was the one on which 
tion men, had assumed the strongest posi- 
our was ve to the other scales. One-half 
Pletion E pemuubed for the essay. On com- 
Promised the essays, the experimenter 
day, © return grades on the following 
On a yp 
to the teas basis, grades were assigned 
ing on en ve One third of the subject’s writ- 
third ees : topic received a grade of A, one 
Was anne a grade of D, and one third 
when pa no grade. This last group was told 
Were Sees were returned that the ratings 
mmedigtannicted due to insufficient time. 
tades, th y after returning the essays and 
Administ, le attitude questionnaire was re- 
0 indie — Finally, subjects were asked 
The fale . their satisfaction with the essays. 
in all a number of subjects participating 
ey re peed of the study numbered 127, of 
bling aad Woe an essay on legalized gam- 
ears 69 wrote on socialized medicine. 
comput ape each 10-item attitude scale were 
ay ed by summing the responses to each 
given or which a weight from 1 to 5 was 
Spee Hie total minimum possible score 
sible ch scale was 10 and the maximum pos- 
ey Score was 50. The dependent change 
Subj ahs was derived by subtracting each 
on desks score on the posttest from his score 
the © pretest. ‘A change in the direction of 
a Position taken on the essay was given 
Positive sign. Change 12 the opposite di- 


113 


rection was negative. To eliminate negative 
numbers a constant of 20 was added to all 


change scores. 
RESULTS 


It was predicted that subjects who 
were awarded an A would change on 
the average in the direction of their 
essays to a greater extent than sub- 
jects who were given a D. Table 1 
presents the data relevant to this pre- 
diction. Subjects receiving an 
changed an average of 31.76 points in 
the direction of their essays while sub- 
jects who were given D changed 25.85 
points. This difference is significant at 
beyond the .01 level. Comparisons of 
the groups that received a grade with 
the group that did not, indicates sig- 
nificantly greater change (p < .05) 
for the subjects who received an A 
than for those given no grade, while 
no difference is suggested between the 
subjects obtaining a D and subjects 
receiving no grade. 

Analysis of mean change for each of 
the issues suggests similar results to 
those obtained in the overall analysis 
although the differences obtained were 
more striking for the socialized medi- 
cine issue than for legalized gambling. 
An analysis of mean change in relation 
to initial position indicates that those 


TABLE 1 
Mean ATTITUDE CHANGE 
Ne 
Chaneet SD) Gre wv Bay 
Mean 31.76 25.85 27.11 
SD 11.16 7.84 8.03 
Differences between groups? 
t Pp 
A vs. D: 2.84 <.01 
A vs. No: 2.13 <.05 
>.10 


0.73 


D vs. No: 
® A constant of 20 was added to all change scores. 
b Because of heterogeneity of variance, t tests were 
computed employing the procedure recommended by 


114 


who had initially assumed a favorable 
position on each of the issues (i.e., in 
favor of legalized gambling or social- 
ized medicine) changed significantly 
more ( = 4.73, p < .01) than those 
who were unfavorable. Nevertheless, 
the effect of grades on attitude change 
remains similar to that obtained when 
direction of initial position is not con- 
sidered. 
Subjects, in addition to responding 
to the attitude scale related to the 
topic on which they had written an 
essay, also responded to the seale for 
which a related essay had not been 
written. By comparing the change 
scores of subjects who had written an 
-essay on a particular topic with those 
who had not written on that topic it 
is possible to evaluate the effect of es- 
say writing independent of the effect 
of grades.’ The mean change obtained 
for subjects who had written a rele- 
vant essay was 28.24 in contrast to 
22.61 for subjects who had written on 
the other topic. This difference is sig- 
nificant (t = 3.27, p < 01) Suggesting 
that the writing of an essay, inde- 
pendent of grade received, produced 
change in attitude, 


Discussion 


The results suggest support for the 
hypothesis that a “good” grade serves 
to reinforce the behavior for which it 
has been administered. Verbalization 
without a consequent contingency 
seems to lead to responses similar to 
those obtained when verbalization js 
followed by a “bad” grade. “Cognitive 
contact” with the opposing side in and 
of itself does not appear to produce 


* Comparison of the scores of each of the 
experimental groups individually with this 
nonessay writing “control” group Soula 
seem to be precluded because of potential 
unknown effects of simply receiving a eraile, 
Collapsing across groups regardless of ex. 
perimental condition potentially randomizes 
such effects. 


R. N. BOSTROM, J. W. VLANDIS, AND M. E. ROSENBAUM 


effects approaching those obtained 
when reward is forthcoming. ts 
Some insight into additional — 
of the grades is provided by the sl 
sults associated with a question a5 °° 
the satisfaction the subjects aie 
enced with their essays. Of the 52 es d 
jects who received an A, 42 ee 
satisfaction with their essays. Twe 0 
of the 57 subjects who received & 
were satisfied with their essays. 12 
the subjects who received no grade, re 
out of 57 were satisfied. A chi of tbls 
of 41.99 (df = 2) indicated that 
distribution is significant beyond dif- 
-001 level. It is apparent that noe ns 
ference in satisfaction with thei 3 
says appears for those who recelve cn 
poor grade and those who were et nis 
no grade. Both events appear 1? lent 
instance to be functionally equivale 3 
in their effects in contrast to the : é 
fect of a good grade. The no ene 
condition was an unusual one in ¢ the 
practice and seemed to operate for due 
Subjects as a poor grade, perhaps ¢ 
to the frustration of failure for the & 
pectancy of receiving some grade to 
fulfilled. te 
Some evidence is provided that ve? 
balization in an incongruent essay : 
effective in producing attitude oe 
independent of consequences. 2 
Supports the findings of Janis aa 
King (1954) on the relation of 10 
playing and attitude change. é 
© experiment presents evidence” 
only on the effect of grades on one 2° 
Peet of a complex set of behaviors 8?” 
Pearing in the essay writing situatio? 
qualified generalization can be of 
fered that the administered grades af” 
fected in a similar Manner maDY 
other unmeasured aspects of perform 
ance in the situation, ie. composi- 
tional skills, ideational Patterns, affect 
concerning essay writing, ete, Further 


research is necessary to delineate those 
behaviors affected jn Grading situa- 


GRADES AS REINFORCING CONTINGENCIES 


‘hoy Educational practice will profit 

a paises of the functional 

to ol grades for other than descrip- 
nm of assessment procedures. 


SumMary 


reine potential effects of grades as a 
liad tae contingency were exam- 
deca students wrote essays 
lace Ing positions on attitude related 
ie contrary to their previously as- 
ni ona Good and poor grades 
ahd we ndomly assigned to the essays 
ae + a to the students. The ef- 
change ese procedures on attitude 
grades Were evaluated and good 
mifaoe demonstrated to serve a 
bite ra role in contrast to the ef- 
4 poor grade or no grade. 


115 


REFERENCES 


Doos, L. W. The behavior of attitudes. 
Psychol. Rev., 1947, 54, 135-156. 

Epwarps, A. L. Experimental design in psy- 
chological research. (Rev. ed.) New 
York: Rinehart, 1960. 

Janis, I. L., & Kiya, B. T. The influence of 
role playing on opinion change. J. 
abnorm. soc. Psychol., 1954, 49, 211-218. 

Opett, C. W. Marks and marking systems. 
In W.S. Monroe (Ed.), Encyclopedia of 
educational research. (Rev. ed.) New 
York: Macmillan, 1950. Pp. 711-717. 

Scorr, W. A. Attitude change through reward 
of verbal behavior. J. abnorm. soc. Psy- 
chol., 1957, 55, 72-75. 

Scorr, W. A. Attitude change by response 
reinforcement: Replication and exten- 
sion. Sociometry, 1959, 22, 328-335. (a) 

Scorr, W. A. Cognitive consistency, response 
reinforcement, and attitude change. 
Sociometry, 1959, 22, 219-229. (b) 


(Received October 13, 1960) 


GRADUATE 
EDUCATION 
IN 

PS YCHOLOG Y 


Report of the Conference on Graduate Education in 
Psychology, sponsored by the Education and Training 
Board of the American Psychological Association and 
supported by a grant from the National Institute of 


Mental Health, U. S. Public Health Service; held at 
Miami Beach, Florida, November 29 to December 7, 
1958. 


Prepared by the Editorial Committee: 
Anne Roe, Chairman, 
and 
John W. Gustad, Bruce V. Moore, 
Sherman Ross, and Marie Skodak 


Price $1.50 


THE AMERICAN 
PSYCHOLOGICAL ASSOCIATION 
Dept. Grad 
1333 Sixteenth Street, N.W. 
Washington 6, D.C. 


JOURNAL OF 
EDUCATIONAL PSYCHOLOGY 


Volume 52 


Number 3 


_ June 1961 


EF 
FFECTS OF EXTENDED EFFORT IN CREATIVE 
PROBLEM SOLVING’ 


SIDNEY J. PARNES* 


University of Buffalo 


Sever, : 
: ee have demonstrated 
‘ielogmenst ity of the deliberate de- 
rough of individual creativity 
(Peonee ” educational programs 
ee os : 58, 1960; Taylor, 1959). 
ative te ese evaluated the cre- 
1959) lem solving course (Parnes, 
Bu oe at the University of 
1959. | oe 1949 (Meadow & Parnes, 
a og & Meadow, 1960). In 
Variety problem solving course & 
are used + principles and procedures 
Wo 6h develop creative ability. 
effective ies have demonstrated the 
stormiy ness of the particular brain- 
deferred principle (Osborn, 1957) of 
Uction Judgment during idea pro- 
1959. Pp (Meadow, Parnes, & Reese, 
lor gina & Meadow, 1959). Tay- 
mong ied the use of the principle 
erry groups vs. individuals (Taylor, 

ce & Block, 1957). 

Rig Ra teem result of the Parnes 
Positiy ‘acdow (1959) study indicated a 
nd € correlation between quantity 
eee of ideas. The relationship 
as b, en quantity and quality of ideas 
een of great interest in the study 


a 
fron. ait research was financed by a grant 
Th e Creative Education Foundation. 
for his author is indebted to Hayne Reese 
Well ag gonsuichive review of this paper, as 
makes his assistance with the statistical 


117 


of creative problem solving activities. 
One principle stressed in the course is 
that copious idea production leads to 
more creativity in problem | solving. 
Osborn and others have emphasized 
this need for fluency of ideas. Instruc- 
tors have reported a prevalence of 
better ideas at the end of an idea pro- 
duction period than at the beginning 
(Parnes, 1959). Previous experimen- 
ters found increases in remoteness of 
response and uncommonness of re- 
sponse among succeeding responses on 
creative tasks (Christensen, Guilford, 
& Wilson, 1957). 

The present study involved two ex- 
periments designed to test the princi- 
ple that extended effort in idea pro- 
duction will lead to an increasing 
proportion of good ideas with in- 
creased production. 


Experiment I 


The hypothesis was formulated in 
Experiment I that more good ideas 
will appear in the last half of a sub- 
ject’s total idea output regarding @ 
creative thinking problem than during 
the first half. It was also decided to 
test for any difference between high 
and low producers in the proportions 
of their good ideas that fall in the sec- 


ond half. 


118 


Method 


Subjects. Data were available from two 
groups of subjects. Group 1 consisted of 68 
students registered (but uninstructed) in 
the creative problem solving course at the 
University of Buffalo. Group 2 was com- 
prised of 78 students enrolled in other un- 
dergraduate courses at the university. The 
data for the two groups were not combined 
because a cursory examination gave reason 
to suspect that the groups might differ. 
(Their averages on the experimental prob- 
lem were 7.9 and 6.7, respectively.) Accord- 
ingly, a statistical treatment was applied 
which allowed the use of all the data, but 
took into account this possible difference. 

Experimental problem. The Hanger prob- 
lem used in the present experiment was se- 
lected from Part V of the AC Test of Crea- 
tive Ability. This test is reported to have 
differentiated groups of creative from non- 
creative subjects (Harris & Simberg, 1954). 
Data are available from a previous experi- 
ment which indicated that scores on the 
Hunger problem are positively 


; correlated 
with other tests which have been designed 
to measure creative thinking (Meadow & 


Parnes, 1959) Instructions for the Hanger 
problem required subjects to list all possible 
uses for an ordinary wire coat hanger. Time 
limit for the test was 5 minutes, 

Ratings. The test was group administered 
during regular class periods. The scoring 
procedure was modified to yield a quantity 
and quality score instead of a quantity and 
uniqueness score, as reported in the previous 
experiment (Meadow & Parnes, 1959). Each 
response was scored, by a trained research 
assistant, as indicating either good or bad 
quality. The quality score was defined as 
comprising two dimensions: uniqueness— 
degree to which the response departed from 
the hanger’s conventional use and value— 
the degree to which the response was judged 
to have social, economic, esthetic, or other 
usefulness. 

The scorer was instructed to rate each re- 
sponse on a three-point scale for uniqueness 
and a three-point scale for value. The re- 
sponse was finally scored as indicating good 


®Scores on the Hanger problem in the 
previous experiment correlated with other 
creative ability tests as follows: Guilford’s 
Unusual Uses, 473; Guilford’s Plot Titles 
High, 452; Guilford’s Apparatus, 301; TAT 
Originality, 520. All but the Apparatus cor- 
relation were at the .01 level of Significance. 
The correlation with the Apparatus test was 
significant at the .05 level. 


SIDNEY J. PARNES 


4 jqueness 
quality if assigned a combined uniquen’ 


. it; 
and value score of at least 5. Bind 
score used was the total number shich du- 
quality” responses. Any response bie al- 
plicated (in essential meaning) er scoring: 
ready given was eliminated from t “the pur- 

The rater had no knowledge of was 10 
pose of this study; therefore, there of carly 
reason to suspect any bias in ratings liability 
ideas vs. later ideas. Intersites at random 
for ratings of 50 subjects selecte 
was .74. ‘vided into 

Procedure. The subjects were aoa 
subgroups on the basis of their He 
tities of ideas. Group 1 was dine me 
median of 7.9, and Group 2 at aumber 0 
of 6.7. The subjects with a total nu! ned t 
ideas less than the median were ash gumbe 
Subgroup L, and those with a total were 95° 
of ideas greater than the median propo! 
signed to Subgroup H. To learner o 
tional cell frequencies for the ere eac’ 
variance, the data of 12 subjects, 6 manner 
group, were eliminated in a gan number 
with the restriction that their a ree 
of ideas was near the median. pjects: 
mained 62 Group 1 and 72 Group 2 81°15 

Each subject’s data were divi ber 
halves on the basis of his total num ber © 
ideas. Those subjects with an odd ne dle 
ideas were treated as follows: If the" veed 
idea was a good one, it was divided to the 
the halves by adding one-half point 
Score for each half, 


Results and Discussion 


er 

Table 1 presents the mean ‘oper 
of “good” ideas in each half for = ip 
group. There were more good ide® alf; 
the second half than in the first b 
and more good ideas in Subgroup 
than in Subgroup L, but there hee 
little difference between Group 1 wie 
Group 2. A Lindquist Type III ap%.’ 
ysis of variance of these data, (Lin 
duist, 1956) indicated highly sign™) 
cant differences between Half I 
Half Il (F = 32.16; df = 1,130; P i 
001) and between Subgroups L and * 
(F = 56.37; df = 1,130; p < 001). ae 
other differences were statistically 1@ 
liable. 


The results of the experiment indi- 
cate that extended effort, jy producing 
ideas on a creative thinking problem 
tends to reward subjects With a greate! 


EXTENDED EFFORT IN CREATIVITY 


r i : 
on pa ag good ideas in the second 
reid heir total output. As in a pre- 
1959) xperiment (Parnes & Meadow, 
tomen ncaa relationship was 
OF lace ee quantity and quality 
dtion here was no significant dif- 
Subes ¢ etween high and low pro- 
Meas z their proportions of good 
ditetion: he second half of total pro- 

TI c 

ig ree support Osborn’s the- 
quaintit 2 that in idea production, 
alo a to quality. The results 
Gordon's hee be in accordance with 
i Re ees xplanation of “deferment” 

Sebo process. He describes 

he lites a “the capacity to discard 
shadow. "her immediate in favor of a 

one, ut possibly richer future” 

Problem | cae The noncreative 
- tr a ver gets an idea, sees it as 
settles e solution to his problem, and 
creative» it without further ado. The 
With his prekian solver is not satisfied 
invests — idea. Like the person who 
Wards betes to obtain greater re- 
Boes the “se the creative person for- 
ing his f mmediate reward of apply- 
Detter sche idea, in expectation of a 
Diitele A ee (greater reward) ulti- 
by Sone further hypothesis suggested 

hat the * s and Gordon’s theories is 

otal 8 idea will come late in the 

current] uction period. The author is 

est this considering experiments to 
particular hypothesis. 


Experment II 


Dine 
tena Periment II was designed to de- 
ound § whether the type of results 
Subjeete Experiment I with untrained 
jects re would also occur with sub- 
Ph scr in the use of the brain- 
ent, cs principle of deferred judg- 
fee t was also decided to lengthen 

5 Ime period to 15 minutes instead 
se in order to see if a trend could be 
erved towards increasingly greater 


119 


TABLE 1 
Mean NumBer oF Goop IpEAS AMONG 
First AND SECOND HaLves OF ALL 
IpEas aNp awona ALL IpEas 


Group First | Second | Ait Ideas 
Group 1 
Low 69 1.11 1.81 
High 1.61 2.48 4.10 
Mean 1.15 1.80 2.95 
Group 2 
Low 51 1,24 1.75 
High 1.47 2.42 3.89 
Mean -99 1.83 2.82 
Combined 
Low 60 1.18 1.78 
High 1.54 2.45 3.99 
All Subjects 1.07 1.81 2.88 
Mean 
mber of 


Note.—Low refers to the group whose total nu 
ideas was less than the median number of ideas; High 
rs to the group whose total number of ideas was 


refei 
an the median number of ideas. 


greater thi 


proportions of good ideas as a subject’s 
total quantity increases. 


Method 


Subjects. Forty: 
tive problem solving 
sity of Buffalo serve 
periment was conducte' 


half of the course. 
Experimental problem. The Hanger prob- 


Jem was used as in Experiment I. However, 
the trained students were very familiar with 
the brainstorming principle. Therefore, they 
were urged to follow this principle of de- 
ferred judgment during their idea produc- 
tion. Time limit was 15 minutes. 
Ratings. The same rating system was 
used as in Jexperiment J. However, before 
the new experiment was conducted, the rater 
had already been told the purpose of the 
original study. If he were given the subjects’ 
protocols, his ratings of the ideas might have 
been influenced this time by the relative 
position of each idea on the page. Accord- 
ingly, the rater was given each subject’s list 
of ideas typed onto new sheets in alphabeti- 
eal order. Thus he did not know the position 
pject’s total list of ideas. 


of an idea in a su 
had scored the ideas on the 


After the rater 


-two students in two crea- 
classes at the Univer- 
d as subjects. The ex- 
d during the final 


120 


typed sheets, clericals transferred these rat- 
ings back to the subjects’ original lists. 
Procedure. The total list of ideas of each 
subject was then divided into thirds—a first 
third, a middle third, and a last third in a 
similar manner as was used to divide the 
total ideas into halves in Experiment I. 


Results and Discussion 


The mean numbers of good ideas 
per third were 4.84, 5.09, and 5.81 for 
the first, middle, and last thirds, re- 
spectively. The analysis of variance of 
these data indicated that the main ef- 
fect of thirds was significant (F = 
4.12; df = 2,84; p < .05). t tests 
showed that the mean for the last 
third was significantly greater than 
the means for both the first third (t = 
2.75; df = 84; p < .01) and the mid- 
dle third (t = 2.04; df = 84; p < .05). 
The means for the first and middle 
thirds did not differ significantly (t < 
1.00). Thus the findings of Experi- 
ment I are supported, and a trend is 
suggested towards increasingly larger 
proportions of good ideas with in- 
creased quantity. The results seem to 
provide further interest in the hypoth- 
esis that the best idea will come late 
in the total production period. 

The data of the first 5 minutes only* 
were examined on the same basis as 
Experiment I, For these first 5 min- 
utes, a mean of 3.02 good ideas were 
produced in the first half of total ideas 
and a mean of 3.26 in the second half. 
Analysis of variance of these data in- 
dicated that these means were not sig- 
nificantly different. It is interesting to 
note that these means are both almost 
double to triple the means for the un- 
trained subjects reported in the results 
section of Experiment I. This supports 
the evidence of the earlier experiments 
which demonstrated the effectiveness 
of the brainstorming principle and of 

“Subjects had been requested at periodic 


time intervals during Experiment II to draw 
a line under their last idea. 


SIDNEY J. PARNES 


training in its use (Meadow et al., 
1959; Parnes & Meadow, 1959). 


Discussion 


Examination of the protocols sug- 
gested that the untrained subjects kes 
Experiment I seem to start their a 
duction with more of the earn : 
unoriginal ideas—high in usefu o 
but low in uniqueness. Toward mie ae 
of a 5-minute period, however, aon 
untrained subjects may find ec 
selves exhausted of these ideas, seat 
may begin groping for less obv! os 
ones. This may lead to their gre < 
production, among the later —* 
ones high in both usefulness ood” 
uniqueness, the criteria for the g with 
ideas. This seems to be consistent a 
the findings of Christensen ie ey 
(1957), where uncommonness an with 
moteness of responses increased ots 
succeeding responses. In the prove a 
of the trained subjects of Experi 
II, there seem to be many ° the 
unique and useful ideas naar? pe 
first ones; but within a 15-minu te 3 
riod these trained subjects may * 
tend to exhaust themselves of er 
ideas and find that they too hav 
“stretch” their imaginations *peit 
fully. This may explain why 3D 
strongest production of good idea Asi 
the 15-minute period does not aPP 
until the last third of their ideas. dif- 

In order further to explore the 
ference in results between trained a 
untrained subjects, a sign test an@ ve 
was made of scored data on hand ee 
180 untrained subjects of a preves) 
experiment (Meadow & Parnes, 196% 

or an entirely differe of pr 

lem. (Guilford’s Plot ‘Tities) with, ® 
-minu st peri seas tO 
cated no seuitonct ; These data 1 2 


ber of quality ideas in firs? 
half (mean of .57 Me cat nalf 
Ghee of .62) of the idea ists 

Thus we have data suggesting: (4) 


t difference in nU™ 


EXTENDED EFFORT IN CREATIVITY 


fi = 
haeeanrtes subjects, significantly 
4 As ideas among later ideas in 
ape period, but not signifi- 
for Fil in a 3-minute stint; (b) 
aan ce subjects, significantly more 
fitout as among later ideas for a 13- 
ifore fo ee not significantly 
from th ; 5-minute stint. Theorizing 
might = our sets of related data, it 
get at so — that subjects will 
second a - as many good ideas in the 
regardin of their total idea output 
8 in th 2 : creative thinking problem 
creasin tst half, but that with in- 
aa e production time, the propor- 
will eee ideas in the second half 
might iy over the first half. It 
rained : further theorized that un- 
Tig: ‘pro ubjects will show this increas- 
trained portion more rapidly than 
earned a sie who have already 
early idea be more creative in their 
and unty s. Ina sense, both trained 
stimulatt ‘ained subjects seem to be 
tended es their creativity by ex- 
rained ort in idea-finding. For the 
(on the ary a 5-minute period 
Hot seem 7 of problem used) does 
extended 2 be long enough to elicit 
might a ort on their parts. (This 
accomplishes analogously to the 
on led musician practicing on 
trained er’s Piece.) However, for un- 
6 Bode 5 minutes did seem 
tendeq eo ae time to call for ex- 
entatio, ort. Much further experi- 
an hiettaess is needed with both trained 
of time Poms subjects, given a variety 
in onde Imits and types of problems, 

t to verify the theory suggested. 


a 


rE SumMaRry 
the 2 eriment I was designed to test 
will ypothesis that more good ideas 
deny, PP in the last half of a sub- 
Drobl total idea output regarding a 
WAGh than during the first half. 
subject was given a creative 


think; 
| "inking problem which involved pro- 


121 


duction of ideas for a 5-minute period. 
The resultant ideas were evaluated by 
a trained rater. Number of good ideas 
was tallied for the first half of each 
subject’s total idea list, and for the 
second half of his list. 

The findings demonstrated signifi- 
cantly more good ideas to appear in 
the last half than in the first half of 
the idea lists. A significant relationship 
was also found between total quantity 
and total quality scores. 

Experiment II was designed to de- 
termine whether the type of results 
found in Experiment I with untrained 
subjects would also occur with sub- 
jects trained in the use of the brain- 
storming principle of deferred judg- 
ment. It was also decided to lengthen 
the time period to 15 minutes instead 
of 5, in order to see if a trend could be 
observed towards increasingly greater 
proportions of good ideas as a subject’s 
total quantity increases. Each subject 
was given 15 minutes to produce ideas 
regarding the same creative thinking 
problem as in Experiment I. Resultant 
ideas were evaluated as in the first ex- 
periment. Number of good ideas was 
tallied for each third of each subject’s 
total idea list—first third, middle 
third, and last third. 

The findings showed significantly 
more good ideas to appear in the final 
third of the subjects’ idea lists. Al- 
though there was not a significant 
difference between first and second 
“thirds,” & trend is suggested towards 
increasingly larger proportions of 
good ideas with increased quantity. 

The results are interpreted to in- 
dicate that extended effort in produc- 
ing ideas on & creative thinking prob- 
lem tends to reward problem solvers 
with a greater proportion of good ideas 
among the later ideas on their lists. 

REFERENCES 


Pp. R., GUILFORD, J.P. & Wi- 


CHRISTENSEN, u é 
C. Relations of creative re- 


SON, 


122 


sponses to working time and _instruc- 
tions. J. exp. Psychol., 1957, 53, 82-88. 

Gornoy, W. J. J. Operational approach to 
creativity. Harv. bus. Rev., 1956, Nov— 
Dec., 41-51. 

Harris, R. R., & Supers, A. L. AC Test of 
Creative Ability: Examiners manual. 
Deiroit: General Motors Corp., AC 
Spark Plug Division, 1954. 

Lrxpquist, E. F. Design and analysis of ez- 
periments in psychology and education. 
Boston: Houghton Mifflin, 1956. 

Meavow, A., & Parvyes, 8S. J. Evaluation of 
training in creative problem solving. J. 
appl. Psychol., 1959, 43, 189-194. 

/Meapow, A., Parnes, S. J., & Reese, H. In- 
fluence of instructions and problem se- 
quence on a creative problem solving 
test. J. appl. Psychol., 1959, 43, 413-416. 

Osporn, A. F,. Applied imagination. New 
York: Scribner, 1957. 

Parnes, 8. J. (Ed.) Compendium of research 
on creative imagination. Bufialo, N.Y.: 
Creative Education Foundation, 1958. 

Parxes, 8. J. Instructor’s manual for semes- 
ter courses in creative problem-solving. 


SIDNEY J. PARNES 


Buffalo, N-.Y.: 
Foundation, 1959. 
Parxes, S. J. (Ed.) Compendium No. 2 of 
research on creative imagination. Bul- 
falo, N.Y.: Creative Education Foun- 


Creative Education 


dation, 1960. 
parses, S. J., & Meapow, A. Effects of 
“brainstorming” instructions on creative 


problem solving by trained and un- 
trained subjects. J. educ. Psychol., 1959, 
50, 171-176. 

Parves, S. J., & Meavow, A. Evaluation of 
persistence of effects produced by 4 &re 


ative problem-solving course. Psychol. 
Rep., 1960, 7, 357-361. : e 
Tavtor, C. W. (Ed.) The third Universit 


_of Utah research conference on 
identification of creative scientific tal- 
ent. Salt Lake City, Utah: Univer. Uta 
Press, 1959. BH 

Taytor, D. W., Berry, P. C., & Biock, C.. ‘ 
Does group participation when us” 
brainstorming facilitate or inhibit 338; 
ative thinking, Admin. sci. Quart. t 
3, 23-47, 


(Received October 3, 1960) 


Journal of Ei 
Educational P: 
1961, Vol. 52, No. 3, ae 


PATTE 
oie oe Mo SELF-CONCEPTUALIZATION IN 
OOL AND COLLEGE STUDENTS 


WILBUR C. WASHBURN 


University of California 


Self- 
amege re have been related to 
Roontiva child development in the 
cliolopiste orientations of various psy- 

ities ee gon Erikson, Sarbin, 
meciee’ ee Fromm. Although the 
always ton self-concept” has not 
traits” de een used, the “character 
ceed ae by these writers cor- 
called a ; what is now frequently 
this ime f-concept. For purposes of 
at a self-concept is regarded 
sonality enduring aspect of per- 
aby sin motive, attitude, or value 
relates fee of which the individual 
ment, ‘The ‘ to his social environ- 
sets of PS alec applies to the person’s 
0 other a himself in relation 


PRrocEpURE 


The 
Wig was to find patterns of 
escentg <i useful in describing ado- 
grate the young adults and to inte- 
ate ge pV OROUS theories that formu- 
devslontent aoe in terms of child 
0 sare Since many existing sets 
ally inf, ed self-concepts were origi- 
early q erred on the basis of very 
may Pegi Seen periods, they 
ore mat e adequately descriptive of 
Bronedune 1 levels of personality. The 
hypothety. then, was to translate the 
Offered — patterns of self-concepts 
est linn a variety of theorists into 
cups a to administer the test to 
ate adolescent and young adult 
hetical , and to revise the list of hypo- 
tive, fag ors of self-conceptualiza- 
€st iter he basis of intercorrelation of 
Comtoonie The resulting sets of self- 
sis of were then ranked on the ba- 
maturity ratings provided by 


123 


groups of graduate students specializ- 
ing in developmental psychology. 

Since the results of any statistical 
analysis of test items depend in part 
on the nature of the items selected for 
analysis, an effort to minimize this 
limitation was made by including 
items based on a variety of theoretical 
viewpoints. Table 1 indicates the pos- 
tulated patterns of self-concepts for 
which items were devised and lists a 
sample item for each category. Items 
were based on the following theoretical 
formulations: (a) the empirical selves 
hypothesized by Sarbin (1952), (b) 
the psycho-social stages stated by 
Erikson (1960), (c) the character 
types described by Freud (1933), (d) 
the personality patterns postulated by 
Fromm (1947), and (e) the interper- 
sonal attitudes developed by Horney 
(1945). 
Construction of the Testing Instru- 

ment 

The test originated with a cluster 
analysis, using the technique described 
by Tryon (1939), applied to Sarbin’s 
five empirical selves. The author con- 
structed 135 items representing self- 
concepts postulated by Sarbin. Five 
graduate students specializing in per- 
sonality theory were asked to classify 
each item into the category it was sup- 
posed to measure. All five judges cor- 
rectly classified 110 items, which then 
were administered to a grouP of 100 


high school students. 
Four patterns of _ self-concepts 
f the cluster 


emerged as a result 0 
analysis. Sarbin’s first three empirical 


selves were highly intercorrelated and 


124 


WILBUR C. WASHBURN 


TABLE 1 Ware Basse? 

Tue PostutaTep Patrerns or Seur-Concepts on Wuicu Test IreMs 

Ressonality Hypothesized Self-Concept Example of Item* 

Erikson | Infancy (Trust vs. Mistrust) Most persons can be trusted. |. 

Early Childhood (Autonomy vs. | I prefer to do things for myse!!- 
Shame and Doubt) 3 ap friends together- 
Play Age (Initiative vs. Guilt) | I take the lead in getting frie 
School Age (Industry vs. In- | I really enjoy working. 
feriority) : vn ideas eveD wher 
Adolescence (Identity vs. Iden- | I usually stick to my own ideas 
tity Diffusion) friends disagree. find someone who 
Young Adulthood (Intimacy vs. | When feeling blue I try to fin id 
Isolation) is cheerful. hing that wou 
Adulthood (Generativity vs. | I wish I could discover somet nin 
Self-Absorption) benefit future generations. ot them. 
Senescence (Integrity vs. Dis- | I try to find my faults and cor 
gust) 
Freud Oral Character I look on the bright side of things- 
Anal Character I tend to save my money. 
Genital Character I love humanity in general. 

Fromm | Receptive Character I prefer letting others make ple other. 4 
Exploitative Character I want to win when pitted again’ interest 
Hoarding Character If buying a car, I would be mos 

in its economy. 3. 
Marketing Character It is ariprtentet always please other 
Productive Character Friends say I am creative and orig! 

Horney | Compliant (Moving Toward) T enjoy just being with people. 
Detached (Moving Away) I prefer being alone. 3 
Aggressive (Moving Against) I often object to another’s ideas. 

Sarbin Somatic Self My body h tly influenced me. 
Receptor-Effector Self can't yait te eat han, hungry- 4 
Primitive-Construed Self I let off steam when angry. yoid BF 
Introjecting-Extrojecting Self T usually do the expected thing to © ke 

c approval of others. ui 
Social Self cheated on a test, I would feel 8° 8 


I couldn’t stand it. 


® Items were selected randomly for illustration. 


were labeled the Somatic-Primitive 
Self. Items from Sarbin’s fifth empiri- 
cal self had a high negative correlation 
with the first cluster and were called 
the Inner-Controlled Self. Sarbin’s 
fourth empirical self split into two 
sets of items. One cluster was labeled 
the Submissive-Dependent Self, while 
the other was tentatively named the 
Socially Integrated Self. These labels 
were assigned each cluster by con- 
sidering apparent logical relationships 
among items. ; 

Items representing a wider variety 
of theories were then added. The au- 


p 
thor constructed 250 items based ty, 
the theories of Erikson, a ere 
Fromm, and Horney. The judges stem? 
again asked to classify each new? 
into the category or categories ee 
sumably measured, and the 225 1 ent 
on which there was 100% agree™ up 
were administered to a second poe 
of 100 high school students, A bise!? d 
coefficient of correlation was compU a 
to show the relationship of each it 
to the total score of the four existing 
clusters. Items were assigned 0 th 
cluster with which they had the nigh? 
est correlation. 


ne 


PATTERNS OF SELF-CONCEPTUALIZATION 


All items representing Horney’s 
Pe et orientation a ae i 
nonreei Fromm’s receptive and 
i “ ing characters. were added to 
Oe est measuring the Submissive- 
~ependent Self. A large number of 


items were added to the test measuring . 


the Inner-Controlled 
; r Self from those 
Bicones Freud’s anal character, 
lore ah hoarding type, and the prob- 
ae ai ed by Erikson under school 
fee ha olescence, adulthood, and se- 
Shae, <9 The cluster labeled Socially 
rit d Self absorbed many items 
Sivas Freud’s oral and genital 
Baten ers, Fromm’s productive char- 
7 Eger the problems stated by 
‘Gane under adulthood and senes- 
fs this Fhe of the items added 
$0 ihe i ola suggested a similarity 
isthe es personality de- 
cluster y Maslow (1954), and the 
y ci renamed the Integrative- 
Shing ete Self. A set of items was 
ion with ave a high negative correla- 
elf and the Submissive-Dependent 
trollin ann labeled the Outer-Con- 
concen elf. This new pattern of self- 
items S was composed largely of 
sive Or ting Horney’s aggres- 
ae ene although it also con- 
described be measuring the problems 
early ehil by Erikson as typical of 
school ildhood, the play age and the 
ol age. Another group of items had 


125 


named the Detached-Independent 
Self. It was composed of items meas- 
uring Horney’s detached orientation 
and the problems outlined by Erikson 
with regard to the periods of infancy, 
early childhood, and young adulthood. 

Table 2 indicates relatively high 
negative correlations between certain 
of the six resulting patterns of self- 
concepts. Each pair of clusters with 
high negative correlations described 
contrasting _ self-concepts. Hence 
forced-choice items were developed in 
which items from negatively corre- 
lated clusters were combined as al- 
ternative choices in a single question. 
Subtest 1 measured the Somatic-Prim- 
itive Self and the Inner-Controlled 
Self. Subtest 2 measured the Submis- 
sive-Dependent Self and the Outer- 
Controlling Self. Subtest 3 measured 
the Detached-Independent Self and 
the Integrative-Actualizing Self. Sam- 
ple forced-choice items from each sub- 
test are presented in Table 3. 

The use of forced-choice items of- 
fered several advantages. It resulted 
in a 50% reduction in the number of 
answers to be scored on each answer 
sheet. Once a score was obtained for 
a given cluster, the score for its nega- 
tively correlated counterpart could be 
computed simply by subtracting the 
first score from the maximum attain- 
i (1954) points out 


able score. Anastasi 
technique provides 


int 

anes Gontrolling Self 

Inte, ~Controlled Self 
Brative-Actualizing Self 


a 
i 2 ) 1 } : 
Tategr — correlation with the that this testing d 
ative-Actualizing Self and was changing alternatives and thus avoids 
TABLE 2 
INTERCORRELATIONS AMONG Seur-PATTERNS 
Submissive- Detached- Outer- Inner- Integrative- 
- li Controlled | Actualizing 
Deneaient Independ Contrall ing an re le walt 
Somat: 
cee rimitive Self IT 10 21 —.88 —.12 
Detaghye-Dependent Self 10 —|86 18 —.08 
Out hed-Independent Self == 09 =/6L ex: 
—.19 ll 
13 


WILBUR C. WASHBURN 


TABLE 3 


SampLe IremMs FROM THE THREE SUBTESTS 


Subtest Examples of Test Items* 
would be 
1 (A) If I won $1,000, I would be | (B) If I won $1,000, Ty most of 
more likely to save most of poor Hie aes T need to 
(A) Panes Gonisatod it for future needs. os T eood tint: ‘ 
. just taking? 
; imi ‘ 1d prefer just t 
tic-Primi- (A) I would prefer having par- | (B) I wou baal ife. 
i oe Belt ents, teachers, or employers easy and enjoying 
think well of me. fia 
its feel 
n ily t B) My body and its, uence . 
me ieee a eae) Oe have had a greater infl 
on me. 4 
angry: 
(A) I say little when angry. (B) I let off steam when ‘teas 
_ lice" 
(A) I am sure most policemen | (B) It is likely too mere theil. 
are helpful. ~ men try to 
authority. a4 
2 


(A) Outer-Control- 
ling Self 


(B) Submissive-De- 
pendent Self 


3 


A) Integrative-Ac- 
(4) tualizing Self 


B) Detached-Inde- 
id dependent Self 


(A) I like giving directions to 
others. 


(A) I want to be considered 
strong by others. 


(A) I would like to be thought of 
as an important person in 
my community. 


(A) I often raise objections to 
another’s ideas. 


(A) My former teachers would 


be more likely to call me a 
leader. 


(A) I often enjoy change and 
rather look forward to try- 


ing new things and meeting 
new people. 


(A) One of my best featu: i 
that I like people. sis 


(A) Trealize human nature isn’t 
perfect, but I can accept it 
and even like it as it is. 


(A) I am left with a feeling of 


strength upon seeing or 
hearing something truly 
beautiful. 


(A) I am usually optimistic and 
enthusiastic and seldom 
lonely and depressed. 


5 aii caetee 
(B) I prefer following directio 
given by others. 


i ht of a8 
I prefer being thous; 
es dnd and sympathetic. 


z to b 
B) I wouldn’t care 
é aR hoes an important 
son. 


; f 
deas ° 
sually accept the} te 
= sees Yithout being CT 
ical. 


jd 

(B) My former teachers wou x 

be more likely 
follower. 


d to. 

islike change and tepc 

= foot a eomfortable in 2e\ 
situations. 


3 38 
f best features, 
” that T don't let feelings , 
fluence my judgment. 


ue 
(B) I find it hard to accept bi 
man nature as it 1s. 


' d by 
B) I am not too impresse 0 
. art or music, although thery 


things seem beautiful 
others, 


(B) There are quite a few tim? 
when I feel alone and afray, 
and just about the who 
world seems unfriendly. 


® Items were selected randomly for illustration. 


| 


PATTERNS OF SELF-CONCEPTUALIZATION 


the tendency toward a response set 
characteristic of constant-alternative 
true-false items. 
fa alternatives were se- 
ted from items which seemed to 
ely fit together. However, items 
chosen by over 90% or fewer than 


ae of the total group were elimi- 
ated, since they seemed to reflect ex- 


| 


Sy 


tremes of social approval and disap- 
ote New forced-choice items, 50 
LC eo subtest, were added to exist- 
es ge and the revised test admin- 
ro to a third group of 100 high 

Col students. A biserial coefficient 


of alah 
°F correlation was computed showing 


ine, Zelationship of each item to the 
20 Ege Items correlating less than 
fates, the total subtest were elimi- 
Was th The new forced-choice form 
300 en administered to a group of 
nye creraduate students at the 
oe of California, each item 
a Si riea with the total score, and 
v, 1onal items eliminated. 
Ge of the subtests was ap- 
dee by asking high school counse- 
oy the rate students for each subtest 
aura basis of an adjective checklist 
ibing characteristics appropriate 
or the subt i 
ive te est. For example, descrip- 
2st rms applicable to the first sub- 
4 Senne eee the following words: con- 
Ub; lous, controlled, conforming. 
Poe with ratings in the upper and 
dum, 27% of the group based on total 
srutber of adjectives checked for each 
8e0 test were then compared for mean 
res on the subtest and ¢ ratios com- 
ted. The differences between means 
€re all significant at less than the 
“l level of confidence. Similar results 
®re obtained in a second validation 
; Udy. Corrected split-half reliability 
fies cients were above -80 for groups 
°m the tenth grade to college under- 
"aduates.t 


1 *. 
tin, 

b A copy of the tes 

fen deposited with the 


g instrument has 
‘American Docu- 


127 


Five graduate students at the Uni- 
versity of California were asked to 
rank each cluster in terms of relative 
maturity as inferred on the basis of 
their knowledge of developmental psy- 
chology. Each judge had completed at 
least two advanced undergraduate 
courses and graduate research in this 
area. A second group of five graduate 
students with similar qualifications 
but from a different institution also 
ranked the six patterns of self-con- 
cepts. While both groups agreed on the 
rank order, agreement was less con- 
sistent for the relative position of the 
Detached-Independent Self and the 
Outer-Controlling Self than for the 
other levels of self-conceptualization. 
There was 100% agreement on the 
Somatic-Primitive Self; 90% each on 
the Dependent-Submissive Self, the 
Inner-Controlled Self, and the Inte- 
grative-Actualizing Self; 70% on the 
Detached-Independent Self; and 60% 
on the Outer-Controlling Self. 

The six resulting self-patterns, listed 
from the one rated least to the one 
ranked most mature, may be described 
as follows: (a) the Somatic-Primitive 
Self involves inability to postpone 
need satisfaction and failure to iden- 
tify with authority figures, (b) the 
Submissive-Dependent Self involves 
avoidance of disapproval by submit- 
ting to others, (c) the Detached-Inde- 
pendent Self jnvolves avoidance of 
interpersonal relations and situa- 
tions likely to arouse emotion, (d) 
the Outer-Controlling Self involves 
achievement of social status through 
manipulation of the external world in 
socially approved ways, (e) the Inner- 


cument No. 
6612 from ADI Auxiliary Publications Proj- 

Library of 
Congress; ., remitting 
in advance 
for photocopies. 
Chief, Photoduplication Service, 


Congress. 


Library of 


128 


Controlled Self involves development 
of inner controls through internaliza- 
tion of external social norms, (f) the 
Integrative-Actualizing Self involves 
acceptance of oneself and others and 
a tendency toward increased creativ- 
ity and productivity. 


RESULTS 


A Comparison of High School and 
College Groups 


The final form of the test was uti- 
lized with a sample of high school and 
college students. The high school stu- 
dents were from a large urban high 
school in California and were ran- 
domly selected within categories to 
constitute a stratified sample propor- 
tionate to the college group regarding 
racial and socioeconomic classifica- 
tion. The college students consisted of 
undergraduates attending the Univer- 
sity of California at Berkeley, and 
the sample was selected to be repre- 


WILBUR C. WASHBURN 


reported for males and females 1D se 
groups and for college preparatory al 
noncollege preparatory high pi a 
students. The college nee 
group consisted of students ta a8 
courses which will enable them to a 
tend a private or state college, aa 
vate or state university, oF dates 
college. The college sample cons smilat 
a group selected from a sample Sl a 
to the college preparatory eer in 
the basis of academic achiever ig 
high school. Further details a all 
these groups are presented in Ta nage 
Means for high school and _e ee 
groups are indicated in Table 5. ‘ se 
rate means are not given for eae ilege 
group within the high school or co nif 
groups, since no statistically Ae on 
cant differences were found w! “a 
gard to such age levels. Since eet 
subtest measures two negatively ove 
related self-levels, one rated as chet 
tively more mature than the ° - 
the means presented in Table 5 are 


sentative of this campus with regard the self-pattern with the higher aoe 
to fields of study. Separate results are turity ranking. For example; 
TABLE 4 
Composition oF Groups Usep 1n rap Srupy 
= =, : . chool: 
aracteristics College Sample High Seabed NGacal- colt peparator? 
fe eee a 
Males 
66 100 
Females 49 rr aap 
Median Age (Males) 22 wy 
: yrs.-1 mo. 16 yrs.-11 mo. 16 yrs.-10 m 
Median Age (Females) 21 yrs.-10 mo. | 16 yell mo, | 16 yrs.-9 m°- 
Caucasian 99 86% 73 86% 202 86% 
Other ? Ao i eG ud of 
7 6% 5 6% 14 8% 
Upper Socioeconomic Status» 32 
p . C 28: 28% 
Middle Socioeconomic Status 60 eee “a 52% me 5270 
Lower Socioeconomic Status» 23 20% 17 20% 47 20% 
First Year 
Second Year a 0 0 
Third Year 20 30 3 
Fourth Year 19 ob 148 


® Based on father’s occupation. 


’ 


i 
} 
| 
| 
| 


| 


PATTERNS OF SELF-CONCEPTUALIZATION 


129 


TABLE 5 
A Comparison or High Scooot anp CoLLecE Groups 


Males 


High School, Noncoll 
= , ege 
a College Preparatory 


* euah School, Noncollege vs. High 
chool, College Preparatory) 


¢ (High School, Coll 
vs. College) , College Preparatory 


Famnales 

igh School, Noncoll 

Hi , Noncollege 

Cig me College Preparatory 


eh School, Noncollege vs. High 
chool, College Preparatory) 


¢ igh Sch 
va, Cont — College Preparatory 


Miles-Females Compared 
< igh School, Noncollege Females 
8. High School, Noncollege Males) 


t igh School, College Preparatory 
Prnales vs. High School, College 
reparatory Males) 


# (College Females vs. College Males) 


Subtest 1 Subtest 2 Subtest 3 
N 
M SD Big SD Sg SD 
40 70.1 16.3 | 50.4 11.6 | 70.0 15.1 
100 78.2 25.8| 58.4 13.9 | 74.2 20.7 
66 79.3 16.2] 66.7 11.9] 76.3 16.6 
1.81 3.18** 1.15 
-25 3.44** 57 
45 71.5 20.4] 41.0 15.1] 77.5 14.3 
135 79.0 19.1] 50.1 14.5] 86.6 16.0 
49 78.0 16.7] 55.9 14.4] 71.5 19.2 
2.46 3.65** 2.27 
62 2.60 4,20** 
33 3.06** 2.35 
-58 4,39** 3.018" 
-31 4.38** 1.42 


tea, 
Significant at the .01 level of confidence or less. 


me given under Subtest 1 are for 
i Inner-Controlled Self, and the 
Ca: ans for the Somatic-Primitive Self 
he be determined by subtracting the 
a means from the maximum pos- 
‘ le score of 124 points. Means pre- 
€nted under Subtest 2 apply to the 
uter-Controlling Self, and means for 
© Submissive-Dependent Self can be 
determined by subtracting the stated 
Means from 103, the highest attainable 
Score for this subtest. Means listed for 
ubtest 3 represent the Integrative- 
Ctualizing Self, and means for the 
etached-Independent Self may be 


computed by subtracting the stated 
means from the maximum possible 
score of 120 points. 

Although high school boys preparing 
for college tended to score higher than 
those not preparing for college on all 
three self-levels rated most mature, 
the difference was significant at less 
than the .01 level only for the Outer- 
Controlling Self. Although college 
males scored higher than high school 
college preparatory males on all three 
mature levels of self-conceptualiza- 
tion, differences were slight and not 
significant at an acceptable level ex- 


130 WILBUR GC 


cept for the Outer-Controlling Self, 
which involved a difference significant 
at less than the .01 level. 

High school girls preparing for col- 
lege scored higher than noncollege pre- 
paratory females on the three self- 
levels with highest maturity rankings. 
Differences were significant at less 
than the .02 level for the Inner-Con- 
trolled Self, at less than the .01 level 
for the Outer-Controlling Self, and at 
less than the .05 level for the Inte- 
grative-Actualizing Self. Means repre- 
senting the Outer-Controlling Self 
were higher for college women than 
college preparatory high school girls, 
and the difference was significant at 
about the .01 level. However, female 
college students obtained a lower mean 
than college preparatory high school 
girls on the test measuring the Inner- 
Controlled Self, although the differ- 
ence was not statistically significant 
at an acceptable level. The college 
females also had a lower mean than 
college preparatory females on the 
test measuring the Integrative-Actual- 
izing Self, and the difference was large 
and significant at less than the .01 
level. 

Both groups of high school females 
scored higher than corresponding 
groups of males on the test measuring 
the Inner-Controlled Self, while males 
scored slightly higher than females 
on this test at the college level. How- 
ever, these differences were neither 
large nor statistically significant, All 
male groups scored higher than cor- 
responding female groups on the test 
measuring the Outer-Controlling Self 
and all differences were relatively 
large and significant at less than the 
.01 level. High school girls had higher 
means than high school boys for the 
Integrative-Actualizing Self. The dif- 
ference between means on this test 
was significant at almost the .02 level 
for noncollege preparatory students 


. WASHBURN 


and at less than the .01 level for a 
lege preparatory students. The tren 
was reversed at the college level, 
college men tended to score Sigil 
than college women on this test, an 
though the difference was not statis 
tically significant. 


Discussion 


It is to be expected that college = 
dents would tend to score higher me 
high school students on tests mit both 
ing mature self-levels as a result 0 
of college experience and selection - 
more mature students for cera a s 
the university. This hypothesis ee 
to be confirmed for males with s 
to all three more mature ae 
although differences are large a2 “aly 
nificant at less than the .01 level Fur 
for the Outer-Controlling Self. low 
ther, females do not seem to ae er- 
this trend in relation to the Ion a 
Controlled Self and the Integrative” 
Actualizing Self. However, these 7¢ 6 
tively low scores of University e 
California females indicate the Ee 
group is more highly select than ae 
college males used in this study. grict 
tisties for the high school dis a 
show that about 1.5 times more ter 
males than males from this area @? 
private colleges rather than chaser’ 
public universities. The emphasis a8 
academic achievement as an ont 
requirement at the university possib : 
results in selection of more competi!’ 
individuals and would account in pat 
for the relatively high scores of Oe 
male and female college groups oP t 
test measuring the Outer-Controlling 
Self. The tendency for males consist 
ently to score higher than females 07 
the test measuring the Outer-Contro!- 
ling Self is to be expecteq in view © 
child training practices in our societY 


which from early childhood permit 
males greater freedom than females i? 
exploring and manipulating their en- 


PATTERNS OF SELF-CONCEPTUALIZATION 131 


oo Since females mature both 
: eee and socially at an earlier 
a ome! males, it is not surprising to 
cra - high school females tend to 
oi aa than males on the two 
pein terns with the highest maturity 
‘aad nee while the trend is reversed 
ab Pho es score higher than females 
i university level. 
fe oa oc for high school groups 
dears a that college preparatory stu- 
Gare ie to have more mature self- 
Pindieete than noncollege preparatory 
signifien, hae large and statistically 
freon a a differences obtained be- 
college college preparatory and non- 
test me preparatory students on the 
Self su asuring the Outer-Controlling 
stu hee that college preparatory 
statis 3 show greater drive to attain 
ink ong before attending college. 
Saree research confirms dif- 
college » in self-concepts related to 
Gilles attendance and preparation for 
si on » It becomes important to con- 
PR rkesern a factors along with 
Dredictin faye as a basis for 
ege pa aa success among col- 
with te the results are likely to vary 
bentied ie theoretical viewpoints repre- 
eelf-patte the test items analyzed, the 
should erns described in this study 
ttrthe not be regarded as a final list. 
3 inied the six self-levels should not 
Which aa as rigid categories into 
an all all persons may be classified on 
if ee ee basis. Every subject 
one at least some items for every 
seem, the six self-patterns. Hence it 
im S$ safe to assume that the more 
Mature levels continue into at least 


early adulthood to some extent in all 
individuals. 
Summary 

Items measuring self-concepts were 
constructed representing the theories 
of Sarbin, Freud, Erikson, Horney, 
and Fromm. A cluster analysis yielded 
six self-patterns, which, ranked from 
least to most mature on the basis of 
ratings by graduate students in 
psychology, were: (a) the Somatic- 
Primitive Self, (b) the Submis- 
sive-Dependent Self, (c) the De- 
tached-Independent Self, (4) the 
Outer-Controlling Self, (¢) the Inner- 
Controlled Self, and (f) the Integra- 
tive-Actualizating Self. Significant 
differences in means on tests measur- 
ing various self-levels were found for 
contrasting groups of males and fe- 
males, college preparatory and non- 
college preparatory high school stu- 
dents, and college and high school 
students. 

REFERENCES 

Awastast, A. Psychological testing. New 

York: Macmillan, 1954. 
Errxson, BE. H. Youth and the life cycle. 

Children, 1960, 7(2), 43-49. 

Frevp, S. New introductory lectures on psy- 

choanalysis. New York: Norton, 1933. 
Fromm, E. Man for himself. New York: 

Rinehart, 1947. 
Horney, K. Our inner 


Norton, 1945. : 
Mastow, A. H. Motivation and personality. 


New York: Harper, 1954. 

Sarpy, T. R. Contributions to role-taking 
theory: III. A preface to a psychologi- 
cal analysis of the self. Psychol. Rev., 
1952, 59, 11-22. : 

Trron, R. C. Cluster analysis. Ann Arbor: 
FEdwards, 1939. 


(Received July 13, 1960) 


conflicts. New York: 


Journal of Educational Psychology 
1961, Vol. 52, No. 3, 132-135 


MANIFEST ANXIETY AND ACHIEVEMENT 
TEST PERFORMANCE 


HAYNE 


W. REESE 


University of Buffalo 


According to Spence’s (1956) theory 
the effect of motivational or drive 
level on performance depends on the 
complexity of the task. In a “simple” 
task, in which the correct response 
tendency is stronger than competing 
response tendencies, subjects with 
higher drive perform at higher levels 
than subjects with lower drive; but in 
a “complex” task, in which the correct 
response tendency is weaker than com- 
peting response tendencies, they per- 
form at lower levels than subjects 
with low drive. These effects depend 
on the energizing property of drive. 
Castaneda (1956) has suggested that 
the inverse relationship between drive 
level and performance may also be 
obtained in simple tasks, such as a 
reaction time situation, in which the 
correct response is mediated by im- 
plicit verbal responses. Theoretically, 
a drive produces “drive stimuli” which 
may be conditioned to responses. If 
these drive-produced responses inter- 
fere with the implicit verbal responses 
which mediate the instrumental act, 
the task becomes a complex one and 
there will be more interference under 
high drive, since the drive-produced 
responses have a greater probability 
of occurrence, than under low drive. 
The Children’s Manifest Anxiety 
Scale (CMAS) was designed by Cas- 
taneda, McCandless, and Palermo 
(1956) as a measure of drive in chil- 
dren. Relationships have been demon- 
strated between CMAS scores and 
school achievement (McCandless & 
Castaneda, 1956) and performance on 
simple and complex learning tasks 
(Castaneda, Palermo, & McCand- 


less, 1956; Palermo, Castaneda 
McCandless, 1956) See 
that children with high CMAS iat 
are superior to low wm sub] 

on simple tasks and inferlo 
plex ro 9 in line with the interpret 
tion of CMAS scores as indican 
drive level. ae 

It has been suggested that the AS 
verse relationship between ex 
scores and performance on Cone 
tasks may result from the ee ne 
correlation between CMAS score ound 
intelligence which has been “less 
(Hafner & Kaplan, 1959; MeCa? 

& Castaneda, 1956; Phillips, Kines9) 
McGuire, 1959). Phillips et al. 
reported that when the effects © 
telligence were partialed age AS 
was no relationship between stery 
scores and performance on 4 ba in 
of psychometric tests differing 
“complexity.” : 

The erent study was designed 
provide additional evidence res" nik? 
the relationships among manifest ® “A 
iety, intelligence, and performan i 
An arithmetic test was given uD at 
time limit, and it was assumed ep 
for fourth and sixth grade school © 
dren the test would be more nea? 
speed test than a power test, 1.¢ hich 
it would be a simple task in W™ a 
speed was emphasized (as in the * - 
action time situation). Tt was furthe 
assumed that the motor respons? 
(writing the answers) would be V@ 7 
bally mediated, and that manifest 
anxiety would produce response® 
which would interfere with the medi- 


ators (i.e., the symbolic processes) « 
Therefore, the inverse relationship be~ 


132 


gees ">". ee ee — eee 


ANXIETY, INTELLIGENCE, AND TEST PERFORMANCE 


tween manifest anxiety and perform- 
ance was predicted. 


MetHop 


Pg mag The subjects were 539 fourth 
Williars grade school children from the 
The cm spe y New York, school district, 
ok an additional 85 subjects were 
fests, eed Noe subjents missed one of the 
sibiccis, s were not available for 31 
eee Procedure. The subjects were 
pelle CMAS and one week later an 
Orble” test consisting of 40 addition 
Bas, ae each with four three-digit num- 
ie thee apie test had a 13-minute 
the teach - Both tests were administered by 
standard pa to their regular classes, using 
Mental Aun ee Scores on the Primary 
administe ae Test, which is routinely 
Wet oh red in the third and sixth grades, 
obtained from school records. 


T RESULTS 
wee 1 presents mean scores on the 
Wathen af and provides further infor- 
culty | about the sample and the diffi- 
if ni of the achievement test. 
lated - ty level is not necessarily re- 
ees 0 “complexity” of the test, since 
ih yee response may be dominant 
nr Tesponse hierarchy but have a 
t A slag id of occurrence and 
tonite: a long latency. Under these 
when Ho low scores would result 
short.) e time limit is relatively 
are relations among the variables 
presented in Table 2.2 Manifest 


1 
Benne writer is indebted to W. E. Keller, 
Sch ising Principal, Williamsville Central 
cols; L. R. Cofi inci 
Ele . R. Cofran, Principal, Academy 
Sa ean School; R. A. Wilson, Principal, 
ee Elementary School; K. L. Merrick, 
Seat e Forest Blementary School; and E. 
and ©, Principal, Maple Elementary School, 
3 their teaching stafis, for their fine co- 
anetation in providing facilities and for their 
ie istance in many other ways. The achieve- 
ia test was constructed, and the instruc- 
Ons standarized; by” Pe des Cutting, But 
< lo, New York, whose assistance is grate- 
ully acknowledged. 
Castaneda (1956) suggested that there 
May be two kinds of anxiety, “verbal” and 
Nonverbal” or “égomatic.” The former should 


133 


TABLE 1 


Mean IQ, CMAS Scores, AND 
Anrrametic Test SCORES 


Group 
Dac mn ees 
Measure Fourth Grade | Sixth Grade 
Females| Males |Females Males 
N 139 | 144 | 118 138 
IQ 112.4} 108.6) 112.3]/108.7 
Anxiety Score 15.8) 14.4] 14.9 13.3 
Lie Score 5.0} 4.3] 3.4 3.5 
Number Right 11.8} 10.9} 21.9] 20.6 
on Arithmetic 
Test 
Number Wrong 6.2| 6.8) 4-3) 4 2 
on Arithmetic 
Test 
a ee 


anxiety was negatively correlated with 
the number of correct answers on the 


achievement test, though the correla- 


tion was not significant for fourth 
de further in- 


grade males. To provi 
formation about the relationship be- 
tween manifest anxiety and achieve- 
ment, the subjects of each sex in each 
grade were divided into six subgroups 
on the basis of CMAS scores, and their 
achievement test scores were com- 
pared. The main effect of Anxiety 
Levels was significant (F = 6.73; df = 
5,464; p < 001), but this factor did 
not interact significantly with grade 
or sex, indicating that there was a sig- 
nificant relationship between manifest 


which interfere with medi- 


ators in the present type of task, but the 
latter should not. Verbal and somatic sub- 
scales were constructed from the CMAS, 
and it was predicted that the correlations 


with achievemeD’ s 
itive for the latter, since 


produce responses 


produces respons 2 ; 
mediators. The average correlations with 
for verbal anxiety 


achievement were —20 for x 
and —.15 for somatic anxiety. The failure 
to confirm the prediction may reflect a lack 


of validity of the subscales or a flaw in the 
theoretical analysis. 


154 


TABLE 2 
Pearson Propuct-MomEent CorRELATIONS 
amonc CMAS Scores, IQ, anp Num- 
BER RIGHT ON ACHIEVEMENT TEST 


Group 
_ eariables, | Fourth Grade | Sixth Grade 
Females | Males | Females | Males 
IQXR 408" | g7ee 5gtt* | ageee 
Anxiety XR |—.a3ee* |—115 19° 
Lie X R —.lL |=.a2ee * |=.10 
Anxiety ¥1Q_ |-.23* |—.01 |—.18* | 03 
Lie X 1Q —.14 — |—.a5e9 | gaeee | ize 
Anxiety XLie | .00  |—.2gee¢|—.05 — |—_agese 


Note.—R” refers to Number Right on the achieve- 
ment test. 


"p< .05, 
**p < .025, 
SF pe SOL. 


TABLE 3 
PartTIaL CoRRELATIONS BETWEEN NUMBER 
RicuT anp Manrrest ANXIETY, WITH 
IQ ParTIALED Out aNp witH Liz 
Score PartTiatep Out 


Group 


Variable 


Partialed Out Fourth Grade Sixth Grade 
Females | Males | Females| Males 

IQ —.26 | —.20 | —.14 | —.23 

Lie Score — .383 | —.22 | —.25 | —.22 


anxiety and performance on the 
achievement test, but the trends were 
not significantly different in the dif- 
ferent sexes or grades. The eta for the 
combined sexes and grades was about 
.21, which was about the same as the 
mean of the Pearson correlation co- 
efficients between anxiety and achieve- 
ment (cf. Table 2). Therefore, it was 
concluded that the relationship be- 
tween anxiety and achievement 1s 
negative and linear. 

Examination of Table 2 shows that 
manifest anxiety was more strongly 
correlated with achievement test 


HAYNE W. REESE 


scores than with 1Q. Table 3 — 
partial correlations between ae i 
and achievement, with IQ paru a 
out, and with lie score partialed = 
Partialing out IQ tended to oe 
correlations for females by abou “tor 
and to increase the correlation’ ihe 
males by about .04. Partialing ou a 
lie score increased the correlation 
average of about .03. : saat 
The multiple correlations eon 
dicting performance from ee the 
correlations for IQ 
combination of anxiety 
improved prediction, compare 
anxiety alone, by an average © nd lie 
.07. The use of anxiety, 1Q, mpare 
score improved prediction, ¢0 abot 
with IQ alone, by an ersne an 
04, and compared with anxiety 
1Q, by only about .01. ae 
Che ie scone was significantly ae 
tively correlated with achieve fe- 
fourth grade males and sixth ord the 
males. Partialing out 1Q gat 
correlations to .03 and —.19, Te 
tively. 


Discussion 


‘ati 

The data indicate that fie as 
ship between manifest anxie y nedi- 
performance on a task requiring 7) 
ation is monotonic and negation F 
that although statistically sig? Pal’ 
it is not a strong relationship. the 
tialing out IQ had little effect 0 the 
correlations, but on the other ham ith 
combination of manifest anxiety "he 
IQ did not appreciably increase - 
accuracy of predicting performan’ 
over that for prediction on the basis 0” 
TQ alone. Therefore, i must be ¢0! 
cluded that although the relationshiP 
between manifest anxiety and per 
formance is not a result. of the 


sonia between anxiety and iD- 
euigence, manifest anxiety has rela- 
tively little influence on es on ap 


ANXIETY, INTELLIGENCE, AND TEST PERFORMANCE 135 


achievement test sed i 

the present hi einen 
ae ae ee does not seem to serve 
* bes ul purpose as far as prediction 
me ae rmance on the present type of 
ee concerned, though it does ap- 
‘sal is a tendency to falsify 

, at least in males. 


z SumMMARY 
ae and sixth grade children 
depiits a the Children’s Manifest 
ahfesen cale and an arithmetic 
dition test, consisting of 40 ad- 
4 i. lems, which was given with 
ash hte It was found that mani- 
oninti ety was inversely and mono- 
Sari a, related to the number of cor- 
Pati on the achievement test. 
fhe os Ing out IQ had little effect on 
set po er between manifest anx- 
of — achievement, but predictions 
epee were not appreciably 
fess re. d by the combination of mani- 
xiety with 1Q. 
REFERENCES 
Castanea, A. Reaction time and response 
plitude as a function of anxiety and 


stimulus intensity. J. abnorm. soc. Psy- 
chol., 1956, 53, 225-228. 

Castanepa, A., McCanpuess, B. R, & 
Patermo, D. S. The children’s form of 
the Manifest Anxiety Scale. Child De- 
velpm., 1956, 27, 317-326. 

Casranepa, A., Patermo, D. S., & McCanp- 
Less, B. R. Complex learning and per- 
formance as a function of anxiety in 
children and task difficulty. Child De- 
velpm., 1956, 27, 327-332. 

Harner, A. J., & Kaptan, A. M. Children’s 
manifest anxiety and intelligence. Child 
Develpm., 1959, 30, 269-271. 

McCanotess, B. R., & CAsTANEDA, A. Anxi- 
ety in children, school achievement, and 
intelligence. Child Develpm., 1956, 27, 
379-382. 

Parermo, D. S., Castanepa, A., & McCanp- 
Less, B. R. The relationship of anxiety 
in children to performance jn a complex 
learning task. Child Develpm., 1956, 27, 
333-337. 

Puurs, B. N., Kine, F. J., & McGunre, C. 
Studies on anxiety: I. Anxiety and per- 
formance on psychometric tests vary- 
ing in complexity. Child Develpm., 1959, 
30, 253-259. 

Spence, K. W. Behavior theory and con- 
ditioning. New Haven: Yale Univer. 
Press, 1956. 


(Received July 25, 1960) 


Journal of Educational Psychology 
1961, Vol. 52, No. 3, 136-147 


CREATIVE AND ACADEMIC PERFORMANCE 
AMONG TALENTED ADOLESCENTS 
JOHN L. HOLLAND? 


National Merit Scholarship Corporation, Evanston, Illinois 


The need to understand the nature 
of academic achievement and creative 
behavior is pressing; such knowledge 
is vital for the proper development of 
educational practice, the administra- 
tion of scholarship and fellowship pro- 
grams, and the fostering of talented 
behavior generally. Unless attempts 
to encourage the development of crea- 
tive talent rest on valid assumptions 
about the nature of academic and 
creative behavior, such efforts will 
have little useful influence. 

The present study is one of several 
investigations planned to explore the 
nature of academic achievement and 
creative behavior in adolescents and 
young adults (Holland, 1959a, 1959b, 
1960a). The earlier studies were con- 
cerned primarily with academic per- 
formance. (This study tests a large 
number of hypotheses about variables 
which are often assumed to be asso- 
ciated with both academic and crea- 
tive achievement—aptitude scores, 
self and teacher ratings, originality 
measures, parental attitudes and val- 
ues, vocational interests and aspira- 
tions, and background information, 
Most of the variables were used to test 
a limited number of hypotheses; a few 
were included for exploratory pur- 
poses.\The hypotheses about the na- 
ture of the academic achiever were 
derived in part from an extensive lit- 

*This study was partially supported by 


research grants from the National Science 
Foundation and the Old Dominion Founda- 
tion. 

?T am indebted to ¢ 
members for their skillful assistance in all 
phases of this project: Virginia Chalmers, 
Laura Kent, Donald Thistlethwaite, and 
Elizabeth van Laer. 


he f ollowing staff 


erature which suggests that the he 
dent who gets good grades n wit 
school is bright, peraiaiant, oe neil 
ing, self-controlled, respons! eychers 
ous, and rated high by his te 
d’Heurle, Mellinger, 
an Gough, 1953; Holland, ne 
1959b). Drews and Teahan (195 
dicate that the academic a¢ niet 
tends to have a somewhat —_—e 
ian mother. Several recent r959) 
(Holland, 1960a; MacKinnon, high 
imply also that good grades ‘ age 
school and college may be eit slate 
related to, or negatively yar 
with, potential for creative pe! dings 
ance. Taken together, these fin tions 
have somewhat consistent implica his 
about the academic achiever om hy- 
parents’ attitudes and values. Th ae 
potheses derived from the litera 
and tested here are summarized a$ 
lows} wean 

ee academic ace 
ment—High School Rank (Hi Lig 
will be positively associated with rest 
scores on the Scholastic Aptitude iT 
(Math and Verbal factors) as 
M and SAT-V), and on the Mas oo 
Deferred Gratification, and Co? _ 
scales; with high Self-Ratings #03 
Drive to Achieve and Perseveran’ 
with high Teacher Ratings of eral 
tity, Popularity, and Social Lea a 
ship; with Fathers’ Values of Sees 
Student; and with mothers’ autho 
tarian attitudes as measured by “” 
Parental Attitude Research Inve? 
tory (PARI). 

2. Outstanding academic achieve” 
ment will be associated with low scores 
on measures of creativity and origi- 
nality (Differential Reaction Sched- 


136 


CREATIVE AND ACADEMIC PERFORMANCE 


ul A 
orgs mp lexity-Simplicity, Independ- 
Oteinalit Judgment, and Barron 
evalations. and with low self- 
lie ne hypotheses about the nature of 
cipall ive student were drawn prin- 
(1958, ae the work of Barron 
acKinn 53b), Gough (1957a), and 
creative on (1959), which depicts the 
plex in Person as independent, com- 
tide oe curious, self-assured, 
art optics interested in science and 
mi aie effective. This char- 
ites, 10n implies also a more per- 
GND parental background which 
Blomtce © independence and the ex- 
Ypothen of self and environment. The 
are su. Ses derived from this literature 
Aenean as follows. 
Dublie Teative performance (winning 
tative oe which demand 
ated aan will be positively associ- 
creativit, high scores on measures of 
Pictio ¥ and originality (Differential 
Dlicity “ Schedule, Complexity-Sim- 
and _ . ndependence of Judgment, 
on the = Originality scales), and 
scales: nitiative and Self-Assurance 
ghar high Self-Ratings of 
Severn ity, Independence, and Per- 
Mins with Fathers’ Values of 
he Independent; and with 
Cr nonauthoritarian attitudes. 
no Pagan performance will have 
tons elation, or negative correla- 
With pay good high school grades; 
With on Self-Ratings of Popularity; 
athens Teacher Ratings; and with 
Stu a ales of Dependable, Good 
and Poe and Well-Adjusted, 


MerHop 


St 
udent and Parent Sample 


ite student and parent samples were ob- 
te from a one-sixth random sample of 
BGR Tae Merit Finalists (approximately 
init high-scoring students by state from an 

ial pool of 478,991 high school juniors), 


137 


who were polled and tested by mail. An 
84% return was obtained, but incomplete 
information reduced the sample to 59%, 649 
boys and 345 girls, and their parents. The 
average aptitude level of these samples on 
the SAT were: For boys, SAT-V and 
SAT-M, 6584 and 697.9 with SDs of 56.8 
and 622, respectively. For girls, the SAT-V 
and SAT-M means are 659.2 and 635.4 with 
SDs of 60.6 and 70.3. 


Academic and Creative Performance 


The criterion of academic performance 
was high school grades (HSR) during the 
first 3 years of high school. The criteria of 
creative performance were derived from a 
checklist of accomplishments assumed to re- 
quire creative or original behavior. Creative 
performance is defined as a performance 
which is accorded public recognition through 
awards, prizes, or publication, and which may 
therefore be assumed to have exceptional 
cultural value. Because of the difficulty in 
arriving at a generally acceptable definition 
of “creativity,” these criteria should perhaps 
be regarded as either “notable scientific or 
artistic performance,” although we will refer 
to the criterion as “creative” performance 
hereafter to enhance readability. With this 
definition as a guide, a list of 20 achieve- 
ments at the high school level was derived 
by reviewing the secondary school achieve- 
ments of Finalists from previous years. 
Items were divided by content into two 
scales: Creative Science (5 items) and Cre- 
ative Arts (11 items). (Four of the original 
20 items were omitted because they ap- 
peared to be inadequate signs of creative be- 
havior.) The items on the two scales are 
shown below. Students were asked to check 
those which applied to them. 

Creative Science Scale 
aper at a sci- 


- 
1. Gave an original p: 
msored by a 


entific meeting spo 
professional society. 
2. Won a prize or award in a sci- 
entific talent search. 
3, Constructed scientific apparatus 
on own initiative. 
4, Invented a patentable 
5. Had scientific paper pu 
a science journal. 
Item 3 does not meet the criteria of 
» and “unusualness,” but 


“public recognition’ L > 
in order to Jengthen the scale, it was in- 


device. 
blished in 


138 


cluded as a low-level sign of creative per- 
formance. Similarly, Item 4 depends on the 
student’s judgment, and may or may not be 
an accomplishment which has received pub- 
lic recognition. 


J Creative Arts Scale 

1. Won one or more speech con- 
tests. 

2. Had poems, stories, or articles 
published in a public newspaper 
or magazine (not school paper) 
or in a state or national high 
school anthology. 

3. Won a prize or award in an art 
competition (sculpture, ceram- 
ics, painting, etc.). 

4. Received the highest rating in a 

state music contest. 

. Received one of the highest rat- 

ings in a national music contest. 

6. Composed music which has been 
given at least one public per- 
formance. 

7. Arranged music for public per- 
formance. 

8. Had minor roles in plays (not 
high school or church-spon- 
sored). 

9. Had leads in high school or 

church-sponsored plays. 

Won literary award or prize for 

creative writing, 

11. Had cartoon published in public 
newspaper or magazine, ete. 

(not high school paper). 


For boys, the estimated reliabilities 
(Kuder-Richardson Formula 20) of the Arts 
and Science scales, respectively, are 36 and 
.55. For girls, the reliabilities are 38 and 37. 
These low reliabilities may be attributed to 
the brevity of the scales; they limit the level 
of correlation between the predictors and 
these criteria. The Arts and Science scales 
correlate with one another 15 and 09 for 
boys and girls, respectively. 


ou 


10. 


Predictors 


Students and parents were sent the fol- 
lowing inventories and scales. Students filled 
out: 

1. The Gough (1957b) Differential Re- 


JOHN L. HOLLAND 


action Schedule (DRS), an inventory wii? 
was designed to predict originality and w! dl 
contains the following scales: beacons 
Competence, Inquiringness, Commie any 
bility, Esthetic Sensitivity, Sense of re ie 
Total Score (sum of all scores), and : a 
tial Success (PIV), an index of general Mae 
and ambition derived from a variety ¢ Re- 
stitute of Personality Assessment a aii 
search (IPAR) studies of achievemen 
ersonal effectiveness. as re 
7 2. The Complexity-Simplicity, a a 
ence of Judgment, and Originality zonal 
from Barron’s (1952) Inventory of rs he 
Philosophy. Only 54 of the 150 items ‘Come 
Originality scale phe ue OE items 
lexity-Simplicity ,scale consists ©" 
Ron the oneal 900-item IPAR inv entorys 


sorrelations 
selected for ont Scale 


scale correlated .7 with the Figure P elope 
seale. The Originality scale was deve Ol), 
out of an MMPI-CPI pool by corres 
each item with a composite score Unusual 
measures of originality (Guilford’s UP aD 
Uses, Consequences, and Plot aitless jp 
the IPAR Word Rearrangement te* ross’ 
a sample of 343 military officers. It was rai 
validated by correlating it with a come 450 
of the three Guilford tests in a sample “Cali- 
undergraduates at the University © 
fornia? : scale 

3. The Mastery scale, an 1B ck’s 
derived from three items on Strodtbe 
Value Scale (1958). 

4. The Deferred Gratification scale, 
merly called the Play scale, from ae ndr 
tional Merit Student Survey (Holla 
1960a). tory 

5. The Vocational Preference Invent’ 
(VPI), a short revision of the Holland nd) 
cational Preference Inventory (Holla 
1958). 

6. The Ghiselli Self-Description 
tory. Since the correlations betworr 
weighted and unweighted scale scores for a 
Initiative, Self-Assurance, and Occupatio? e 
Level scales ranged from 89 to 96, ©. 
items of these scales were scored with eae 
weights rather than being weighted diffe 
entially. . 1 

7. Self-ratings on a four-point scale 224 
a Self-Evaluation score based on the numbe! 


for 


* Frank Barron, personal communicatio™ 
June 1960. 


CREATIVE AND ACADEMIC PERFORMANCE 


of ti . 
tlre pee rates himself above aver- 
7 fate creative Activities scale (CAS), 
Sean 20 bbies and activities assumed to 
of Citi original behavior. Teacher Ratings 
ieee Popularity, and Social 
dtudene hip were also available for each 
pres ee = well as SAT scores, level of de- 
of epee t, Birth Order, and a Breadth 
inivticc ates based on the number of 
aoe and interests checked from a list of 
ioe filled out the PARI (Schaefer & 
en , and 16 of the 23 five-item scales 
family ~ . Fathers gave information on 
and lp ig and ranked nine goals 
sidered a in the order in which they con- 
ose hem valuable for their children to 


ReEsuLts 


ding relational analysis. The 75 pre- 
hee S and criteria were intercorre- 
nist {produst-inoment) and the com- 
sits ‘ x 75 matrices for boys and 
andl eo ong with the means and stand- 
wilh rie cg have been deposited 
dinky a ADI. Tables 1 and 2 show 
‘ules e correlations between the three 
fore 1a of academic and creative per- 
ablay, Te and the 72 predictive vari- 
tin he influence of intelligence has 
sense partialed out, using the SAT-M 
all th as an estimate of intelligence for 
= ee criteria, since the math score 
ri men correlations with the three 
ee than does the verbal score. 
ion rally, SAT-M has low correla- 
8 with the predictors. The largest 


4 
Pr herd following tables have been deposited 
‘abl e American Documentation Institute: 
aie A and B, Correlational matrices for 
stadens girls; Tables C and D, Means and 
vari ard deviations for boys and girls on 75 
a iables; and Tables E and F, Daydreams 
= Sut future occupations for students who 
{re high and low in academic and creative 
Performance. Order Document No. 6611 
bem ADI Auxiliary Publications Project, 
hotoduplication Service, Library, of Con- 
Bress; Washington 25, D. C., remitting in ad- 
Vance $1.75 for microfilm or $2.50 for rivoto- 
Opies. Make checks payable to: Chief, 
hotoduplication Service, Library of Con- 
Bress, 


139 


absolute difference between corrected 
and uncorrected r’s was only .05. 

or boys, a review of the correla- 
tions at the 1% level reveals that the 
three kinds of performance (scientific, 
artistic, and academic) are associated 
with somewhat different variables, 
generally in expected directions. 

The boy with a high score on the 
Creative Science Scale also has many 
artistic achievements (3) and Crea- 
tive Activities (8), plans to get an 
advanced degree (71), is a first-born 
or an only child (74), and has high 
scores on the Independence of Judg- 
ment (17), Mastery (19), Deferred 
Gratification (20), Initiative (23), 
Physical Activity (28), and Intellec- 
tuality (29) scales and low scores on 
the Responsibility (30) and Status 
(36) scales. He rates himself high on 
Originality (37), Independence (40), 
and Perseverance (42). Fathers of 
boys high on scientific performance 
regard curiosity (48) as a valuable 
trait for their sons to have; mothers 
of these boys tend to be agreeable 
(supposedly comfortable in their role 
as mothers) rather than irritable (59). 

The boy with a high score on the 
Creative Arts Scale engages in many 
Creative Activities (8), plans to get 
an advanced degree (71), and has high 
scores on the Esthetic Sensitivity (12), 
Sense of Destiny (13), DRS (Total 
Score) (14), Breadth of Interest (21), 
Self-Evaluation (22), Self-Assurance 
(24), Occupational Level (25), Re- 
sponse Bias (Acquiescence) (26), Re- 
sponsibility (30), Verbal Activity 
(32), Emotionality (33), Control 
(34), and Status (36) scales, and low 
scores on SAT-M (5). He rates him- 
self high on Originality (37), Drive to 
Achieve (39), Independence (40), 
Self-Confidence (41), and Persever- 
ance (42). Mothers of these boys are 
slightly more accepting of the home- 
making role than are mothers of stu- 


140 


JOHN L. HOLLAND 


J TABLE 1 


NCE 
CoRRELATIONS BETWEEN THREE CRITERIA OF ACADEMIC AND CREATIVE PERFORMA 
AND 72 APTITUDE, PERSONALITY, AND BAacKGROUND 


VARIABLES FOR Boys 


(N = 649) 
Creative Creative ‘ 
Performance High Performance Hist 
Variable School Variable —j7_ | Grades 
z Grades ien- atti 
Scien Artistic Seen Artistic 
tific tific 
— 
= 
28** 
1, HSR 00 08 —  ||39. Drive to Achieve 08° or 06 
2. Scientific Performance - 15 00 || 40. Independence 6° l| dageell) 28 
3. Artistic Performance 15** - 08 41. Self-Confidence 10° a 7 oi" 
4. SAT-V 02 —09* | —04 42. Perseverance ut 8 
5. SAT-M os | —19°*| 09° 
6. Humanities Comprehen- | —01 03 02 Teacher Ratings oo" 26°° 
sion 43. Citizenship - 00 05 a7 
7. Scientific Comprehension | 06 02 04 = || 44. Popularity o1 of 19°° 
8. Creative Activities 36** | 37** | —03 || 45. Social Leadership 00 7 
(NMSS) 
Father’s Values and Goals 0 —02 
Gough DRS 46, Defend Self —1 ) oy | 
9. Intellectual Competence | —03 02 | —01 147. Ambitious 06 2 | 00% 
10. Inquiringness oo | 02 02 |/48. Curious aur? | BEF 9 
11. Cognitive Flexibility 02 01 | —09* || 49. Dependable =e | = 0 16° 
12. Esthetic Sensitivity o1 | 12*| 01  ||50. Good Student —02 | % | 95 
13. Sense of Destiny 03 23¢¢| —04 ||51. Happy, Well-Adjusted | —02 a -33"* 
14. Total Score (Above 5) 03 15** | —03 || 52. Independent, Self-Reliant | —02 a 09° 
15. Potential Success (PIV) 01 04 01 (|| 53. Popular —O01 oo | 08 
5 54, Self-Controlled 01 
riginality-Personality 
16. Complexity-Simplicity 07 07 PARI 05 
17. Independence of Judgment} 10* 03 —11** || 55. Fostering Dependency -01 a 10° 
18. Barron Originality 07 08* | —12** || 56. Seclusion of Mother oe | See 09° 
19. Mastery 1*| 10° 13°* || 57. Martyrdom OR 03 
20, Deferred Gratification 11°* 06 11°* || 58. siitanka -07 02 
21, Breadth of Interest oo | 23%] 00 |/59. Irritability aah 06 
22, Self-Evaluation o1 23** | 24** || 60, Excluding Outside a 
Influences 08" 
Ghiselli 61. Suppression of Aggression | 07 | “OF | gg 
23. Initiative 1%*| 07 | —02 || 62. Rejection of Homemaking | —02 
24, Self-Assurance oo" 1a lise Role 02 
25. Occupational Level 01 ui | —07 — |\63. Equalitarianism —05 os 06 
64. Approval of Activity oo: | 0 42°° 
VPI 65. Avoidance of Communica- | 05 8 
26. Response Bias —02 13** | 05 tion 05 
27. Infrequency -07 | —01 03/65. Suppression of Sex 03 | —04 05 
28. Physical Activity ise | —04 02 || 67. Ascendency of Mother —05 | —0" | os 
29. Intellectuality age | 03 07 ~— || 68. Intrusiveness 10* 03 06 
30,, Responsibility —i7** | 20°*| 05 | 69. Comradeship and Sharing | —07 | —02 40* 
31. Conformity 07 a * 70. Avocleration of Develop-| 07 | —0 
32. Verbal Activity —10* men: 
33. Emotionality —05 sic | 02 
34. Control —02 12°* 15** || Miscellaneous 
35. Masculinity-Vemininity | 10* |—05 | 03 |\71. Degree (level) Sought ise | 13%] 08, 
36. Status —1g¢* | 18°* | 12° }|72. Father’s Educational Level| 05 os | ~ 8 
78. Mother's Educational ~02 os | -1° 
Self-Ratings i 
37. Originality ise | 27¢* | —01__ || 74. Birth Order (frat born) oor | —04 o1 
38. Popularity —05 10° 12** || 75. School Size = 03 
oP’ 02 —04 


* Significant at the .05 level. 
** Significant at the .01 level. 


——— CSC — 
NN 


CorRELations BETWEEN THREE CRITER 


CREATIVE AND ACADEMIC PERFORMANCE 


TABLE 2 


141 


1A OF ACADEMIC AND CREATIVE PERFORMANCE 


AND 72 Aptrrupr, PERSONALITY, AND BacKGROUND VARIABLES FOR GIRLS 
(N = 345) 
, Creative 5 Creaiive 
erformance | yy; Creative. || 
i High High 
Variable School Variable ———— School 
i rades = rades 
Sie” [Artistic Scien- | artistic 
1. Hi _——s 
ie oe 00 | —o9 | — _ |/39. Drive to Achieve ise | age | 18" 
3, pear Petformatios — | 09 | —03  |/40. Independence o7 | 12° | -08 
Lae o9 | — | 09  |/41. Self-Confidence os | ue | 1 
5. SATA 07 0 08 || 42. Perseverance age | 13e | 22° 
Ge Mamantts 13* | —12% | 24** 
Humanities Comprehen- | 03 | 05 | 07 ‘Teacher Ratings 
7. Scienti 43. Citizenship 05 | —04 30°* 
i cuitntifie Comprehension | 1* | 01. | 05 44, Popularity -o2 |—o4 | 23° 
eae ’ Lal (ioe i i 0 23¢* 
(WMss) a3e+ | gate | —19** || 45. Social Leadership on 4 
Go} Father’s Values and Goals 
Hibnteey 46. Defend Self os | oo | 00 
10. Tn lectual Competence ~14** | —02 01 47. Ambitious 13° 05 08 
AGaiiee oo | 02 | 03  |/48. Curious os | 03 | —16% 
WB ee oo | 09 | —12* |/49. Dependable 1s] 02 | 16 
18, gent” Sensitivity os | 05 | 01  ||50. Good Student ut | os | ot 
16 ee ot Destiny 03 og | 04 ||1. Happy, Well-Adjusted — | —15* oo | -02 
15, Pott Score (Above 5) —0r tase | 03 |/52. Independent, Self-Reliant | 00 —09 | -01 
* “otential Success (PIV) | —04 00 00 = /53. Popular -u* | -04 | 00 
Original 54, Self-Controlled —05 | -0 02 
16 Rnality-Personality 
Ce sie Aion o2 | o6 | —14** || PARE 
18. B lependence of Judgment} 06 02 | —11* ||55. Fostering Dependency —02 | —10 o4 
19, Been Originality 07 | 10 | —05 _ |]50. Seclusion of Mother “03 | o2 | 18 
Piarewraant 1 | 02 | 08 |/57. Martyrdom —o38 | 05 | 08 
21, Raced Gratification ue | 03 14** |[58. Strictness 03 | 01 07 
2, greadth of Interest ise* | 24%¢ | 09 (159. Irritability —0s |-07 | 03 
* Self-Bvaluation 08 tore | 13% |] 60. Excluding Outside —o1 | —10 u 
ii Influences 
pti 61. Suppression of Aggression —06 —06 16** 
24, initiative oa | o7 | 03. |[62- Rejection of Homemaking -o5 |—05 | 06 
3, gc Assurance o3 | oo | 15% |] Role woolen 
* Occupational Level o1 10 01 |/63. Equalitarianism —o1 | —0 La 
Vey 64, Approval of Activity —o1 | -06 
26 § 65. ‘Avoidance of Communics- oo | -07 13 
7 aponse Bins 00 05 09 tion 
2 ; sf 7 
dy, Enfrequeney os | —08 | 06 _|/6s. Suppression of Sox 0s =m, 10 
99, physical Activity os | —o¢ | —03  |/67. Ascendency of Mother | -O |“ | OP 
30, ntellectuality 08 | tt | nop fas. Intrusiveness ine} os | 1s | 00 
a1, Responsibility 02 06 06 || 69. Comradeship and Shaving | 77 | “oy Des 
32, Conformity 03 | -07 02 ||70. Acceleration of DeveloP- 
ot Verbal Activity —06 06 * ment 
34, Gmotionality —02 10 07 as 
* ~ontrol o4 =| —O1 14** || Miscellaneous eA any 
35, Masculini (level) Sought 19 | 09 
culinity-Femininity 06 06 —02 71. Degree : ib aa 
59. Status —o2 | o2 | 10% |/72. Father's Eduestionsl Level] 08 | 12, | 
Se 73. Mother’s Education 
te Ratin Level s 
s Originality 13* 32** | —03 74, Birth Order (first born) eH ae Hs 
: Popularity —01 10 04 |175. School Size 
* Significant at the .05 level. 
‘o1 level. 


** Significant at the 


142 


dents with low artistic performance 
62). 
, The variables related to academic 
performance are somewhat different 
from those related to creative per- 
formance. Academic performance 
(HSR) is negatively correlated with 
high scores on the Complexity-Sim- 
plicity (16), Independence of Judg- 
ment (17), and Barron Originality 
(18) scales and positively correlated 
with high scores on the Mastery (19), 
Deferred Gratification (20), Self- 
Evaluation (22), Self-Assurance (24), 
Control (34), and Status (36) scales. 
Boys with high grades rate themselves 
high on Popularity (38), Drive to 
Achieve (39), Self-Confidence (41), 
and Perseverance (42). Their teachers 
rate them high on Citizenship (43), 
Popularity (44), and Social Leader- 
ship (45). Fathers of these students 
want their sons to be Good Students 
(50) and are less concerned with their 
being Independent and Self-Reliant 
(52). Mothers tend to be authoritarian 
(at the 1% level), Avoidance of Com- 
munication (65); at the 5% level, Se- 
clusion of Mother (56), Martyrdom 
(57), Suppression of Aggression (61), 
and Acceleration of Development 
(70). 
The findings for girls, shown in Ta- 
ble 2, are similar to those for boys. 
The girl with a high score on the 
Creative Science Scale tends to en- 
gage in many Creative Activities (8), 
plans to get an advanced degree (71), 
has low scores on the Intellectual 
Competence scale (9) and high scores 
on the Mastery (19), Breadth of In- 
terest (21) scales. She rates herself 
high on Drive to Achieve (39). Her 
father ranks being Dependable (49) 
low on the list of goals and traits 
which he values for his children. 
The girl with a high score on the 
Creative Arts Scale has high scores 
on the Creative Activities (8), Sense 


JOHN L. HOLLAND 


of Destiny (13), DRS (Total Score) 
(14), and Breadth of Interest (21) 
scales; has a positive Self-Evaluation 
(22), and rates herself high on Orig} 
nality (37). . 

Academic performance is negatively 
correlated with high scores om on 
Creative Activities (8) and Comples 
ity-Simplicity (16) scales and pa 
tively correlated with high apiett P 
SAT-M (5) and the Deferred Grat 
cation (20) , Control (34), Status ( 4 
and Self-Assurance (24) sonia ve 
high Self-Ratings of Drive to Ac re h 
(39) and Perseverance (42), and ri 
Teacher Ratings of Citizenship ae 
Popularity (44), and Social Lea get 
ship (45). Fathers of girls ne hie 
high grades value being Depen ‘its 
(49) and do not value being ers 
(48). Mothers of these girls are a ae 
acterized by the PARI as aU (56) 
tarian”—Seclusion of Mother a 
Supression of Aggression (61), A° 
eration of Development (70). tive 

The differences between crea 
and academic performance for a it 
variables replicate the trends foun oot? 
boys. Academic performance apP ri 
to be the function of a persona P 
drome characterized by perseverar 
self-control, good behavior (good oaty 
zenship), and rigidity; this person es 
pattern is related to parental attituc™™ 
which seem conducive to such @ Pone 
tern. Creative performance, OP “ 
other hand, seems to be the outcom 
of a conscious conception of beiné 
original, active participation in cre? 
tive hobbies, and reinforcement 
parents who possess values and att} 
tudes which appear to be conducive t 
such performance. 

Free response “analysis. As & part 
of the assessment, students were asked 
about their vocational choice, the fa¢- 
= which influenced this choice, and 

heir image: 
their a ceoupa ty dreams . about 

ms. Their free 


CREATIVE AND ACADEMIC PERFORMANCE 


igieaee’ to the question, “What do 
_ te ream about when you think 
cee : _future occupation?” were 
wlona “tia 10 coding categories de- 
tell tant ea a sample of 200 students 
ef Aen es i reliability on a sample 
ititopend udents. Two raters, coding 
tort ra obtained 75% agree- 
ane ba les E and F* compare the 
i of students high on creative 
rae ance with those of students 
Gi performance, and of 
ons ea igh on academic perform- 
tect students low on academic 
jer, 
Pe shore to Table E, boys with 
day i scientific performance 
hetathieg of high achievement and 
scoring accomplishment”; for this 
Nene moaned the percentage dif- 
ones te etween high and low science 
ie ean, significant at the .01 level. 
wes ae trend is observable in the 
bite le, though the percentage dif- 
level, The not significant at the .05 
the Tiedte responses of students above 
silgantt an on HSR do not differ sig- 
Ow the y from those of students be- 
Gints median. 
ag ee outstanding scientific 
work oo eee, ang more about 
others ( ivities than about helping 
— =A li 001), whereas girls with 
with work vement are less concerned 
ing othe x and more interested in help- 
als (p < .05). Again, the com- 
ern ae high and low on aca- 
“enitoani imacne oe 
5 ia results, which support some 
that correlational findings, imply 
tied ae achievement involves 
ta at different motives than crea- 
he Ayling good grades in high 
Senin appear to be a function of 
lavit pee (citizenship and popu- 
Pee and perseverance, whereas 
his ive performance 1s a function of 
scious concern with high accom- 


143 


plishment, independence, and origi- 
nality. 

Getzel’s analysis. In a recent study 
of elementary school children, Tor- 
rance (1959), following Getzel’s classi- 
fication of students as “high IQ” or 
“high creative,” reports that the most 
intelligent students (top 20%) are not 
necessarily the most creative (top 
20%) : only about 30% of the students 
were in both groups. 

The present study supports Tor- 
rance’s findings. Hight tables were 
obtained for the present study by 
relating SAT-V and SAT-M to the 
scientific and artistic checklist scores 
for each sex. Within each table, dis- 
tributions of creative performance 
scores were formed for approximately 
the top 20% of students on the SAT 
and the top 20% of students on crea~ 
tive performance. The percentages of 
overlap between these two contrasting 
student groups ranged from 1-6.5% of 


the total samples for the eight tables 


and deviate only slightly from an eX- 
The larger de- 


pected overlap of 4%. 
viations were in the direction expected 
from the correlations in Tables 1 and 
2. These results are consistent with the 
negligible correlations between apti- 
tude and outstanding creative achieve- 
ment shown in Tables 1 and 2. We 
may conclude, t that in a group 


of exceptionally bright students, 1n- 
little or no relationship 


telligence has 
in arts and 


to creative per 1 
science or to academic achievement. 


These results may in part be a func- 
tion of the regression of creativity on 
aptitude; students i 
tially for their hig 
then the relationships 


tude and creativity are § 
e of aptitude. These 


only in part, & 
arrow range on apti- 


tude, however. In Tables A and B of 
1 SAT-V and SAT-M 


144 


correlate as much as .64 and .57 with 
the Humanities and Scientific Com- 
prehension scores of the National 
Merit Scholastic Scholarship Test (an 
achievement test given about one year 
before the SAT was administered). 
Even these correlations are probably 
underestimates, due to the artificial 
restriction of range imposed on the 
NMSQT by its use as a selection de- 
vice for this sample. The latter find- 
ings suggest that the SAT does dis- 
criminate efficiently and _ reliably 
within this narrow range of talent so 
that higher relationships between ap- 
titude and creative as well as academic 
performance were possible. 


Discussion 


The results of the present study are 
probably conservative. The distribu- 
tion of scores for all criteria are at 
least moderately skewed and all scales 
have relatively low reliability; more- 
over, the range of scores on the re- 
maining predictor variables, including 
background variables, is restricted. 
Therefore, the correlational results 
are probably underestimated. Since 
the students in the sample are all ex- 
ceptionally talented, the correlations 
between originality measures and 
scholastic aptitude are reduced. 
Whether or not the observed relation- 
ships can be generalized to more repre- 
sentative student populations is ques- 
tionable. 

The correspondences between the 
present study and related studies are 
numerous. The originality scales de- 
veloped by Barron (Independence of 
Judgment, Complexity-Simplicity, and 
Originality) and by Gough (DRS) 
have been frequently correlated with 
criteria of creative performance and 
our results are generally consistent 
with earlier reports (Barron, 1953a, 
1953b; Gough, 1957b). Although the 
relationships obtained in this study 


JOHN L. HOLLAND 


are generally low, it is surprising re 
the Barron and Gough scales we for 
at all, since they were construc te 
adult samples and = differen 
teria of creative behavior. ae 

The CAS has promising validiy 
since it is the best single cae 
creative performance (7's oe ‘Arts 
23-37 for the CAS against the © 
and Science criteria for both . ee 
and since its highest correlation’ few 
with a set of variables which len cane 
port to this interpretation. For reo 
the variables having the largest = end- 
lations with the CAS are, in Cor 
ing order (.87-.15) : Breadth ie me 
est, Self-Ratings of Origina Ty Rat 
Independence, Emotionality, O¢ Scores 
ings of self-control, Total D * degre? 
Complexity-Simplicity, level ib or 
sought, and Barron sae) the OAS 
girls, the largest correlates 0 Interest 
are (.43-—.16): Breadth of a inde 
Self-Ratings of Originality an re 
pendence, Barron Originality, tiny: 
plexity-Simplicity, Sense of ques 
Self-Evaluation, and mother’s & aiid 
tion. The Fostering Dependency, 
Ascendency of Mother scale ith the 
PARI correlate negatively | ). 
CAS (—.16 and —.17, respectively, g 

The assumed validity of the 5 of 
may be overestimated, since 4 OT ities 
the 32 scale items involve actlv 
which are preparatory to or assoc!® 
with the accomplishments listed phe 
Creative Science and Arts Scales. real 
correlates of the CAS listed above ? 
ply some validity independent of ©” 
CAS’s correlations with the perfor™ 
ance scales, however. 

The CAS was suggested by the 
Chorness and Nottelmann study 
(1957) in which it was found that 4 
set of extracurricular hobbies had Sif~ 
nificant correlations (27-45) with 
four or five Guilford factor composites 
when intelligence is partialed out, The 
item overlap, if any di between, ¢he 


CREATIVE AND ACADEMIC PERFORMANCE 


Chorness creatiy i 
C as ie hobbies and the 
of the ey eee concurrent validity 
ete at zl -Ratings is of special inter- 
Rats a. ai variables used, the Self- 
dan, show most clearly that aca- 
ee an creative performers con- 
ies themselves as different from 
high oo It is of interest too that 
Self-Co Fn of Perseverance and 
shagh rs dence—two of the three 
man’s ae ee traits for Ter- 
oops ah and low achievers—are 
etter, here with both creative and 
0 de m i etic (Terman & 
ot findings for the Intellectuality 
Sia ne scales of the VPI 
face to replicate several studies 
1959) j » unpublished ; MacKinnon, 
delat « which vocational interests in 
Rata the arts were found to be 
ina ed with rated creative per- 
Satine he high scores on tests of 
tualit ehavior. Since the Intellec- 
Seen has its highest significant 
ee seme with the Self-Sufficiency, 
of the pee and Masculinity scales 
consiste PF, the present results are 
- ae 2 with the characterization of 
asocial ative person as independent, 
motic and masculine. Similarly, the 
related mality scale of the VPI is cor- 
ingtabane highly with femininity, 
oll ity, and introversion (16 PF) 

and, 1960b). 

Set the results seem to sup- 
hatur any other hypotheses about the 
atten: of the creative person, but an 
find; pt to relate all of the specific 
4 a to the voluminous theorizing 
gant creativity would be a gar- 
ees uan task. Briefly, those hypoth- 
as ' which regard the creative person 
sty, independent, intellectual, expres- 
= €, asocial, consciously original, and 
Se to experience gain some support 
ate 1955, 1957, 1959). The cre- 
ive student not only has an identi- 


145 


fiable personal disposition, but also he 
has relationships with teachers and 
parents which are in accordance with 
our expectations. His parents appear 
to be more permissive and more nur- 
turant of his ideas and impulses so 
that communication with the self and 
the world is stimulated. It is not sur- 
prising, then, that such students ap- 
pear to come in conflict with teachers 
who demand conforming, controlled, 
nonexploratory behavior. 

Perhaps the most unequivocal find- 
ing in the present study is that, for 
samples of students of superior scho- 
lastic aptitude, creative performance 
is generally unrelated to scholastic 
achievement and scholastic aptitude. 
Since the traditional predictors of 
scholastic aptitude are of little or no 
value for predicting creativity, it 
seems clear that scholarship programs, 
colleges, and other agencies, if they are 
concerned primarily with rewarding 
students or selecting employees who 
have potential for creative perform- 
ance, need to make a more active ef- 
fort to devise predictors of creative 
potential. In fact, attempts to build 
better scholastic aptitude tests may 
even be detrimental, since they may 
lead to a greater dependence on instru- 
ments which are of limited value and 
thus delay unnecessarily the develop- 
ment of efficient predictors of creative 
performance. 

The finding that teachers generally 
rate students with good grades higher 
than those with lower grades, together 
with the results of an earlier study 
(Holland, 1959b) which indicates that 
teachers give lower ratings to students 
with more potential for creativity (as 
measured by the 16 PF) than to stu- 
dents with less potential, suggests that 
school personnel value the good grade- 
getter more than the creative student. 
Recently, Jex (1958) found that school 
principals rate teachers with high 


146 


scores on an ingenuity test lower than 
teachers with low test scores. Taken 
together, these results reinforce a com- 
mon belief that some of our traditional 
education may be stultifying rather 
than nurturant and fructifying. 


SuMMARY 


The relationships between three 
criteria of academic and creative per- 
formance and 72 personal, demo- 
graphic, and parental variables were 
studied in a sample of talented adoles- 
cents. The results suggest that creative 
performance at the high school level 
occurs more frequently among stu- 
dents who are independent, intellec- 
tual, expressive, asocial, consciously 
original, and who have high aspira- 
tions for future achievement. Students 
who are persevering, sociable, respon- 
sible, and whose parents hold some- 
what authoritarian attitudes and val- 
ues, are more frequently academic 
achievers. The negligible relationships 
found between academic aptitude and 
creative performance at a high apti- 
tude level suggest that we need to use 
nonintellectual criteria in the selection 
of students for scholarships and fel- 
lowships. 


REFERENCES 
Barron, F. Inventory of personal philosophy. 
Berkeley, Calif.: Univer. California 


Press, 1952. (Institute of Personality As- 
sessment and Research) 

Barron, F. Complexity-simplicity as a per- 
sonality dimension. J. abnorm. soc. Psy- 
chol., 1953, 48, 163-172. (a) 

Barron, F. Some correlates of independence 
of judgment. J. Pers., 1953, 21, 287-297. 
(b) N 

Cuorness, M. H., & Norrecmann, D. N. 
The prediction of creativity among Air 
Force civilian employees. USAF Per- 

sonnel Train. Res. Cent. res. Rep., 1957, 
No. AFPTRC-TN57-36. 

p’Heurte, ApMA, Meiuincer, Jeanne C., & 
Haccarp, B. A. Personality, intellectual, 


JOHN L. HOLLAND 


and achievement patterns in gifted 
children. Psychol. Monogr., 1959, 73 
(13, Whole No. 483). rE 
Drews, Exizaseta M., & Teanan, J. E. 


J Parental attitudes and eae 
achievement. J. clin. Psychol., 1957, 135 
328-332. 


Goucn, H. G. What determines the aca- 
demic achievement of high school stu- 
dents? J. educ. Res., 1953, 46, 321-331. 

Goucu, H. G. Imagination: Undevelopet 
resource. Paper read at Erocesdinie 
the first conference on research and 
velopment in personnel managemen’y 
Los Angeles, California, 1957. (a) aia 

Goucu, H. G. Researchers’ summary for “a 
Differential Reaction Schedule. Univet 
Calif., 1957. (Ditto) (b) 

Hottanp, J. L. A personality inventory 
ploying occupational titles. J. app 
chol., 1958, 42, 336-342. _ ollege 

Hotianp, J. L. The prediction ‘ sopies 
grades from the California Psycho titude 
Inventory and the Scholastic 4 
Test. J. educ. Psychol., 1959, 50, 

je Aa er 

en : L. Some limitations of, en : 
ratings as predictors of creativ: 

educ. Psychol., 1959, 50, | 

Houiann, J. L. The prediction © 
grades from personality an! 
variables. J. educ. Psychol., 
245-254. (a) ; 

Hottanp, J. L. The — ot 
ional Preference Invento! : jre- 
~ Personality Factor Quests. 
J. appl. Psychol., 1960, 44, 29 valuation 

Jex, F. B., & Merrm, R. M. An ev te, Uni- 
‘of the first academic-year ne 
versity of Utah, 1957-58. U. 
Monogr. Educ., 1958, 1, No. 2. 

MacKinnon, D. W. What do we 
talent and how do we test f 
The search for talent: College @ 
sions 7. New York: College Ent 
Examination Board, 1959. ent 

Scuaerer, E.S., & Bett, R. Q. Developm 
of a Parental Attitude Restarch os 
ment. Child Develpm., 1958, 29> 
361. 

Srrovrpeck, F. L. Family interaction, cn 
and achievement. In DD. Cy McClellan? 
A. L. Baldwin, U. Bronfenbrenne?s 
F. L. Strodtbeck (Eds.), Talent on 


society. New York: d, 19 
Ch. 4. rk: Van Nostran 


Paring, Cay (Ed.) The first, second, ard 
third University of Utah research con 


em- 
Psy- 


svenn: BY 
or it? Ins 
dms- 
rance 


CREATIVE AND ACADEMIC PERFORMANCE 147 


ferences on the identification of creative 
scentific talent. Salt Lake City: Univer. 
" Utah Press, 1955, 1957, 1959. 

ERMAN, L. M., & Oven, Meurra H. Genetic 
studies of genius: IV. The gifted child 
grows up. Stanford: Stanford Univer. 
Press, 1947. 

orraNce, E. P. Explorations in creative 


thinking in the early school years: A 
progress report. In, The third University 
of Utah research conference on the 
identification of creative scientific tal- 
ent. Salt Lake City: Univer. Utah 
Press, 1959. Pp. 58-71. 


(Received September 26, 1960) 


Fournal of Educational Psychology 
1961, Vol. 52, No. 3, 148-152 


BEHAVIORS DURING PROBLEM SOLVING AMONG 
CHILDREN OF Low, AVERAGE, AND 
HIGH INTELLIGENCE! 


HERBERT J. KLAUSMEIER 


University of Wisconsin 


The range of differences amon: 
school children in many intellectual 
and psychomotor abilities and other 
characteristics is defined clearly. Less 
clearly defined are the factors which 
contribute to the differences, Preyi- 
ously, no Statistically 
ference was found a! 
low, average, and high 1Q children jn 
retention of arith problems 
graded to each child’s achievement, 
level (Klausmeier & Feldhu: 
nor in emotion 
tion of self- 
in relation to expectancy (Klaus 
Check, & Feldhusen, 
differences with ]j 
between IQ 1 
achievements 


e 8roups in 
Stence, ang 
mode of attack while in 


K ‘ the actual 
process of solving arithmetic Problems. 


Procepurr 
Subjects 


The subjects were 20 boys and 20 girls 
with WISC IQs of 55-80, 20 boys and 99 
girls with WISG IQs of 90-1 


10, and 29 boys 
and 20 girls with WISC IQs of 120— 


146. Th, 
and average [i = 


*The research reported herein was per- 


formed pursuant to a contract with the 
Office of Education, United States Depart- 
ment of Health, Education, and Welfare. 


AND 


LEO J. LOUGHLIN 
Northern Illinois University 


the study were low IQ children ee 
4 second handicap to a marked oe oi 
definite organic symptoms of a pe be- 
Such as mongolism. All had birthda' ber 18) 
tween September 15, 1947, and sn 6 
1948; and the Mean age at the tt a 
gathering the data was 131 months 
group, 


Methods 


rop~ 
The first task was to get probleme ex- 
erly graded in difficulty. After anaes class~ 
Perimenting with pupils in regula in the 
roms and with low IQ pupils not maid 
rudy, @ group of 29 problems at three prob- 
levels of difficulty was arranged. The sub- 
lems for the low IQ children Wal ip- 
tivided further with the lowest. leve the 
tended for those who did not ered not 
number of cents in a dime or who cou leve 
Perform simple addition. A reserve stab- 
of problems of high difficulty was “solve 
lished for the high IQ child who might rates 
The si8ted problem in less than 3 4 fot a 
he least difficult problem designed ins 
low 1Q child Was to make 7¢ with three ‘ng u 
nd the most difficult designed for # ae 
1Q child was to mate $14.53 with 16 bills 
Coins, 
i Besides tl 


"Ms, two o; 


his initial formulation of P ae 
certaj ther measures were used to sriate 
difficult the problem wae of SPP to 
each et, Survey test was administere ur 
money <9 ascertain his knowledge of os 

aes ™; Telated vocabulary, ae 
7" teal skills. Also, if a child co! 

he weed Problem in Jess that 

cult oa) © Next 


Problem in order of diffi- 
Each ¢ Assigned, 


Problem ‘anid Solved only one problem. This 
Viously Wer the vey test mentioned pre 
traineg exa; ea Thinistered by a carefully 
tn Obse eee “nd another trained pe!” 
time the Problewntite procedure. From es 
Hae? the chi instructions wer" 

re in eno 1 d «,2toblem was solve #4 
conean behayj, 1 he incidence of t! 


child were re- 
of coins by the 
lation of figures 


ae 
ee 


BEHAVIOR DURING PROBLEM SOLVING 149 


with pencil and paper by the average and 
high 1Q children; (b) engaging in nontask 
activities or withdrawal which resulted in the 
examiner’s encouraging the child to con- 
tinue his problem solving; (c) offering an 
incorrect solution; (d) noting and correct- 
ing mistakes independently; (e) verifying 
the solution by checking the amount of 
money independently; (f) verifying the 
solution by checking the number of coins 
independently ; and (g) using a random, 
logical, or a combination approach to the 
Solution. In random approach the subject 
merely manipulated or explored the materi- 
als without. an apparent plan of attack. In 
logical approach the subject made deliberate 
manipulations to test successive trials or 
hypotheses. Some children changed from an 
apparently random to a logical approach, 
and vice versa. 

Efficiency of the child’s method was de- 
fined as the ratio of moves (coins or paper- 
Pencil computations made) to the least 
number needed to achieve the solution. For 
example, the least efficient subject made 39 
Moves when only 2 were required, with a 
resulting efficiency ratio of .051. Any instance 
of 8 child’s being encouraged by the ex- 
aminer to cease from a nontask activity or 
to continue at the task was recorded as the 
measure of nonpersistence. The mode of 
attack was observed in the verbal responses 
of the subject and in his manipulations of 
coins or figures, as described in the preced- 
Ig paragraph, ¢ through f. 

The data secured on each subject were 
tabulated and combined according to the 

groups. Analysis of variance was used to 


TABLE 1 


Summary or Onservep Scores AND RESULTS OF Tests oF 8 
or DIFFERENCES 


compare efficiency of method and chi square 
to compare incidences of the other behaviors 
among the three groups. 


RESULTS 


The extent to which the problems 
were of equal difficulty for each 1Q 
group was ascertained by comparing 
the time in seconds required to reach a 
solution to the problem. The low IQ 
group required a mean time of 624.17 
seconds, the average group 627.65, and 
the high group 578.47 seconds. On the 
basis of a small and insignificant F 
value obtained among the means, it 
was concluded that the assigned prob- 
lems were about equally difficult for 
the three IQ groups. 

A summary of the data and the 
results of the tests of statistical sig- 
nificance of differences among the 
three IQ groups and between groups 
are presented in Table 1. Column 
1 lists the observed behaviors. Col- 
umns 2-4 contain the scores or number 
of subjects in each IQ group who dem- 
onstrated the behavior. Column 5 
shows the results of the tests of sta- 
tistical significance of differences 
among the three IQ groups. Columns 
6-8 give the results of the tests of sig- 


TATISTICAL SIGNIFICANCE 


Significance of Differences 


Score or 
Incidence 
Behavior Between Groups 
+p |Aver- " Among 
igh “age [TO | Grou Hi-Av Hi-Low Ay-Low 
eee 
. .05 > p> 01 ns 
Efficie 353 | .309 |.237 | .05 > p > -O1 ns 
Pin meat Poe 3 | 10 | 17 | 01> p> .001).02> p> -01 |.01 > p> .001).02 > » > .01 
Modi idence) 
Cored tseaoet Baie 10 | 16 | 33 .001 > P Suge fae Ulior eee 00} 
Noted and Corrected ea ae be 4 wre 001 >P 001 > p .001 > p 
i jon—Amt. Mone; . ; 
yous Seca mt ine [36 | 25] 9 |  .001>p 01> P spl eae bone 
Ranger outa 1 | 8s | a] .oo1>p |.01>p> 001) |e P 
Tea | 5 | 3 | 01 > p > stot! |ol >» >+001| 101 > p> -001).05 > p.>.08 
gical Approac 


* Not significant at -05 level. 


150 


nificance of differences between groups 
—high and average, high and low, and 
average and low. 

Table 1 shows that the low 1Q group 
was inferior to the high IQ and to the 
average IQ group in seven and six of 
the observed behaviors, respectively ; 
the average IQ group was inferior to 
the high IQ in six behaviors. Although 
the mean number of seconds to solve 
was nearly identical for the three 
groups, the incidences of behaviors 
were greatly different. To clarify the 
differences, short sketches of the pro- 


cedures used by a high and a low IQ 
boy are now given, 


Richard had a WIS 
used 585 seconds to so 
“Make $9.77, using 1 
He studied the situ: 
the first 3 minutes 
tempt at solution 
correct but using 


C IQ of 138 and 
lve the problem: 
2 bills and coins,” 
ation and during 
made a first at. 
, getting the amount, 
13 instead of 12 bills 
and coins—9 one dollar bills, 1 half. 
dollar, 1 quarter, and 2 Pennies, He 
checked the amount and number of 
coins and started over. During the 
next 3 minutes he made his secon 
and third tries, each time starting with 
a five dollar bill. On the Second he had 
12 bills and coins but an incorrect 
amount; on the third try, neither was 
correct. Each time he checked amount 
and number of coins. During the third 
3 minutes he made his fourth try, but 
made a mistake in adding the amount, 
Checking through his solution at the 
beginning of the 


fourth 3-minute pe- 
riod, he found the error and after 45 
seconds offered as a correct solution 


1 five dollar bill, 4 one dollar bills, 1 
half dollar, 1 dime, 3 nickels, and 2 
pennies. Richard made 25 separate 
moves, but only 6 were needed, as 
shown in the final solution. Therefore, 
his efficiency ratio was 6/25, or .24. 
In terms of a problem solving se- 
quence set forth by Merrifield, Guil- 


HERBERT J. KLAUSMEIER AND LEO J. LOUGHLIN 


ford, Christensen, and Frick oa 
Richard (a) accepted the tas bees 
problem and set a goal of ay a 
correct solution, (b) analyze 
5 . : z relation 
available information in be andl 
his goal, (c) produced a gen es 
independently rejected it (ve tapi 
tion), (d) applied knowledge eal 
this and subsequent solutions to jie 
successive solution produced bbe pe 
cation), and (e) finally verifie 
accepted the correct one. - 
Michael, one of the more = - 
1Q boys, had a WISC 1Q of 73 att 
used 805 seconds to “Make 46¢; pe 
4 coins.” Ten coins of each —_ 
nation—penny, nickel, dime, ble 
quarter—had been placed on the Tea 
before him. (In a pretest it had b ond 
established that he had the si poe 
skill in adding and knowledge of ‘“/ 
to solve the problem.) Michael, a 
Richard, restated the problem 0! 
rectly prior to attempting a solution: 
uring the first 3 minutes, Michae 
took 3 dimes and 10 pennies from the 
3 PPly, then returned to the supply 
Pennies, one at a time, and offere¢ 
° remainder as a correct solution. 
pon being told that it was incorrect 
and with all Coins returned to the suP~ 
Ply, he Studied the coins and offered 1 
Petter and 2 dimes as a second solu- 
Sn during the second 3 minute period. 
we te 
Tr eihgate 
ime, i skeen? supply 1 quarter, 


f » and 1 penny and o!- 
itooee + JS Correct. When told it was 
turne i ne ‘the coins had been re- 
during’ the #2" Studied the coins, and 

quarter "th 3 minutes withdrew 
the quarter Pe@2Y, 1 dime, then put 

minut ack. During the fifth 
quarter wy 88in withdrew the 
“UPPly, thus harother dime from the 
and 1 


Dena aving 1 quarter, 2 dimes; 
ua aheee Ne offered as cor- 
&tvably checking ac- 


BEHAVIOR DURING PROBLEM SOLVING 


apes of amount or number of coins. 
Es sag a total of 16 coin movements 
fete only 3 were needed; his ef- 
panes ratio was .188. At no point 
to Sk may to encourage Michael 
h eign lg to keep at the task; 
childve r, like the majority of low 1Q 
anid dia’ he offered incorrect solutions 
oe the not verify amount of money 
I number of coins. 
er ae respects are the low IQ 
dren? erent from the high IQ chil- 
tions = noted in the above descrip- 
superio he high IQ boy was markedly 
. se in verification and reapplica- 
solutio entually arriving at the correct 
mel We Further, other low 1Q child- 
persist Eig from Michael, did not 
ne ba hen informed that their solu- 
three were wrong, and after two or 
ie Such, experiences, manipulated 
weit without producing a close ap- 
Hen imation of the final correct solu- 


Discussion 


hath — to reach a correct solution 
ficie een used as the criterion of ef- 
nt Srare of the problem solving, no sig- 
oh ant difference would have been 
is erved among the three groups. Us- 
ie the ratio of efficiency as the cri- 
Shee a significant difference was 
FS SEL ved. The differences of most prac- 
a significance among the groups 
dene connected with producing pos- 
‘ble solutions, verifying each proposed 
jolution, and applying information 
rom each preceding trial to the sub- 
sequent one. As John (1957), Duncan 
(1959), and others have pointed out, 
Processes such as these used in arriv- 
ing at solutions are the important vari- 
ables in understanding efficiency of 
Performance and learning. 
Whether more of the low IQ chil- 
dren in this study could learn seem- 
ingly simple processes as deciding a 


151 


plan for making 13¢ with four coins is 
not known. Similarly, verifying the 
amount of money and number of coins 
does not appear to be difficult, al- 
though the high IQ children also 
tended to do only one or the other on 
the first try. If these processes can be 
learned, more specific instruction in 
school devoted to helping children ac- 
quire processes, rather than solutions, 
would be helpful. 

Though not studied intensively, dif- 
ferences within each IQ group were 
large. For example, the ratio of ef- 
ficiency varied from .10 to 80 among 
the high 1Q children. Further, 10 high 
and 16 average IQ children offered in- 
correct solutions as shown in Table 1. 
Help given to children of average and 
high abilities when needed might be 
especially profitable in achieving im- 
proved problem solving performances. 


SuMMARY 


Differences among groups of chil- 
dren of low, average, and high IQ were 
found in observed behaviors as they 
solved problems, graded in difficulty to 
their present achievement levels. The 
high IQ children showed a greater in- 
cidence than the average and the low, 
and the average 4 greater incidence 
than the low in noting and correcting 
mistakes independently, verifying 
solutions, and using a logical ap- 
proach; whereas the low IQ showed 
a greater incidence than the average 
and the high, and the average a 
greater incidence than the high in 
nonpersistence, offering an incorrect 
solution, and using a random ap- 
proach. The high IQ children were su- 
perior to the low in efficiency of 
method, as ascertained by the ratio 
of number of moves made to the least 
number required for solution of the 
problem. Differences in performances 


152 


among individuals within IQ groups 
were also large. 


REFERENCES 


Duncan, C. P. Recent research on human 
problem solving. Psychol. Bull., 1959, 
56, 397-429. 

Joun, E. R. Contributions to the study of 
the problem-solving Process. Psychol. 


Monogr., 1957, 71(18, Whole N 
Kuausmerr, H. J,, C. 


HERBERT J. KLAUSMEIER AND LEO J. LOUGHLIN 


uae 
telligence at 125 months of age. Amer 
ment. Defic., 1960, 65, 69-78. J. Bee 
Kuausmerer, H. J., & Se So of 
tention in arithmetic among ti Be: 
low, average, and high intel eee 
117 months of age. J. educ. 
1959, 50, 88-92. a 
Merrirretp, ?. R., Gurtrorp, J. we pe 
TENSEN, P. R., & Frick, J. W. A nbil- 
analytic study of pea ai 
ities. Rep. Psychol. Lab., U. 
Calif., 1960, No. 22. 


(Received October 3, 1960) 


Journal of Educati 
tional Psych 
1961, Vol. 52, No. 3, 153-186 oleh 


THE RELATION BETWEEN SEVERAL PARENT 
MEASURES AND THE CHILD’S EARLY 
ADJUSTMENT TO SCHOOL’ 


GENE R. MEDINNUS 
San Jose State College 


es poceent paper reports data deal- 
his h first grade readiness and ad- 
this 6 in first grade. The purpose of 
hi ne was to explore the relation- 
one etween parental attitudes to- 
Savin & number of aspects of child 
iter” as assessed through question- 
and is to both parents 
stat rough interviews with the 

er, and the child’s general adjust- 
ment in Grade 1. 


Mernop 


Subjects 


a — subjects in the investigation were 
70 P of first graders and their parents. 
throu abate of children were identified 
ment in . he teacher’s rating of their adjust- 
ene rst grade on a five-point scale, with 
of the ‘aaa good adjustment, 3 typical 
and 1 dg justment of the “average” child, 
iia septs poor adjustment. The rating 
teach, ade on the basis of and following the 
Piret re rating of the children on & 52-item 
child: rade _Adjustment Scale. Those 10 
treated receiving ratings of 4 and 5 were 
enn: ed as one group; the other group was 
prised of those 9 with ratings of 1 and 2. 
Ge IQ of the children was 111.6 ac- 
eli ing to the Stanford-Binet (Form L) in- 
\Clligence tests administered by the in- 
stigator. The socioeconomic status of 
ae families was approximately evenly di- 
ided between the upper-lower and lower- 
Tiddle classes according to Warner’s Index 


of Status Characteristics (Warner, Meeker, 
nning of the study 


& Fells, 1949). At the besi 
© mean ages of the mothers and fathers 

Were 32.1 and 34.1 years respectively, and 

he corresponding mean years of education 

Were 12.0 and 12.2. 

as completed under the 

Sponsorship of the Elizabeth McCormick 
emorial Fund of Chicago. 


153 


Scales 


The two instruments employed to assess 
parental attitudes and behavior will be de- 
scribed first. Data from these two measures 
were obtained during the year prior to the 
child’s entrance into first grade. 

The parents were asked to complete a 
form of the Parent Attitude Research In- 
strument (PARI) developed at the National 


Institute for Mental Health. The develop- 
ment of the scale, its uses and reliability 
described in some 


coefficients have been 
detail by Schaefer and Bell (1955, 1958). 
Briefly, the scale is a Likert-type question- 
naire consisting of a number of subscales 
each concerned with various aspects of 
family life, child rearing attitudes, and re- 
lationships. The particular forms of the 
scale employed consisted of 29 subscales 
for the mothers and 25 for the fathers. The 
responses to the items were scored either 4, 
3, 2, or 1, depending upon whether the re- 
sponse was “Strongly agree,” “Somewhat 
agree,” “Somewhat disagree,” or “Strongly 
disagree,” respectively. The questionnaires 
were left with the parents following an 
evening home visit by the investigator. 
They were returned as they were completed. 
The Fels Parent Behavior Rating Scales 
originally were designed in an attempt to 
identify areas of the home environment 
which might be expected to bear some 
causal relation to the development of the 
child’s personality (Baldwin, Kalhorn, & 
Breese, 1949; Champney, 1941). Lorr and 
Jenkins (1953) further analyzed the inter- 
correlations of the Fels scales presented by 
Roff (1949) jn an attempt to determine 
second-order factors of a broad character 
which would be useful for conceptualizing 
the parent-child relationship. Three factors 
emerged; for convenience in the present dis- 
cussion these will be labeled as follows: 
(a) Dependence vs. Independence Encour- 
aging, (b) Democratic vs. Authoritarian, 
(c) Degree of organization in the home. 
"Those five scales having the highest loadings 
on each of these three factors were employed 
in the present study. The ratings were made 


154 


by the investigator on the basis of home 
visits and interviews with the mothers. 
Interrater reliability was established through 
previous ratings of 10 homes by the in- 
vestigator and a graduate assistant? The 
mean reliability of these ratings for the 15 
scales was 80. A list of the 15 rating scales 
is provided in Table 1. 

For the purpose of assessing the sub- 
jects’ adjustment to the first grade situation, 
a First Grade Adjustment Scale was de- 
veloped (Medinnus, in press). Briefly, the 
following procedure was employed in the 
construction of the scale. Twenty-five first 
grade teachers were interviewed concerning 
factors related to a child’s adjustment in 
Grade 1. The teachers were asked to mention 
specific traits, abilities, and items of be- 
havior which, in their opinions, characterize 
or describe good adjustment and poor ad- 
justment in the first grade. The character- 
istics listed for good and Poor adjustment 
were examined for parallel items which 
seemed to define opposite ends of a single 
continuum. Those items for which there 
was at least a minimum level of consensus 
were placed on a five-point scale, with 3 
representing the midpoint or rating of the 
“average” child, and 1 and 5 describing poor 
and good adjustment, Tespectively. The 52 
items were grouped into five major areas: 
Physical Status, Social Behavior, Emotional 
Behavior, Intellectual Abilities and Be- 
havior, and Adjustment to Classroom Mem- 
bership and Requirements, Interrater Teli- 
ability coefficients for the five sections and 
for the full scale clustered about .75. 


RESULTS 


Those PARI seales which differenti- 
ated significantly between the Parents 
of the two groups of children will be 
mentioned first. The fathers and 
mothers were compared Separately 
since the subscales and the items were 
not identical for the two forms, The 
following scales differentiated between 
the mothers of the well- and Poorly 
adjusted first graders at the .05 level 
or beyond: Approval of Activity, Ir- 
ritability, Dependency of Mother, and 
Fear of Harming Baby. Mothers of 
the well adjusted children showed 
more of the attributes measured by 


* The writer is grateful to Floyd Ayers for 
assistance with ratings. 


GENE R. MEDINNUS 


the first three scales than did =“ 
mothers of the poorly adjusted oF 
dren; the reverse was true for ie 
fourth scale mentioned. No re 
reached this significance level for 
— for the Fels scales are 
given in Table 1. 


Discussion 


If there is any validity to the ee 
ise that the child’s behavior and wa 
sonality are in part causally de ual 
mined by parental attitudes one hig he 
expect wide differences between as 
attitudes of parents of well- avai 
Poorly adjusted first grade chil "he 
That such was not revealed in aha 
present analysis is apparent from ie 
fact that only four scales different 
ated between the two groups 
mothers and none of the scales reachet 

€ significance level for the fattiers 

ne possible interpretation of la 
nding emphasizes the lack of vali 4 
ity in the parent attitude scale e™ 
ployed. Another interpretation 5° 
Sumes that there is actually 2° 
elation between parental attitude an 
te child’s general adjustment in the 
School situation. ‘The findings Wit 
rire el Fels scales seem 
terpetation to discount this 

1/8 @Pparent from Table 1 that 
soa 5 of the 15 Fels scales dif- 

lated between the home envi- 
the two groups of chil- 
5 were included in the 
8. Independence Encour~ 
- 0 addition, the parents 
oH le ed group were rated 
iffe cnge ieitousness variable 
ef she statistically 
fond ge he hone a, St 
‘1 
e ependenes Pa rated higher in 
seem ene direction. 
Tun counter 


PARENT MEASURES AND SCHOOL ADJUSTMENT 


Ratines or Homes or Two 


Fels scales 


A. Dependence 
Encouraging 
Tat ies babying 
"1 meets centeredness of home 
49 chino gon for welfare 
212 ras 4 go pene : 

foe y of contact with 


ys. Independence 


siemens vs. Democratic 

Sea Coen ataey of policy 

3.14 Roenenyanens of suggestion 

3.1 yom uetince tion of policy 

a5 Re eaeee eg of regulations 
motionality toward child 


‘ Or, . : . 
Control and Effectiveness of 
aie Pre rae of enforcement 
33 Renan of actual penalties 
7 ccelerational attempt 
-7 Coordination of household 


1.2 Activeness of home 


TABLE 1 
ApsusTMENT GROUPS 

Well Adjusted Poorly Adjusted 

? 
ue o u 7 
51.3 7.91 42.8 7.81 05 
55.3 7.85 42.1 6.28 01 
51.0 7.82 45.1 11.69 ns 
52.7 6.42 40.9 8.48 01 
50.6 8.93 49.0 10.99 ns 
46.4 8.14 52.6 12.87 ns 
47.6 6.92 48.7 12.42 ns 
44.8 6.22 56.3 10.07 01 
47.9 8.31 47.3 18.91 ns 
47.5 8.06 53.2 12.93 ns 
48.8 8.15 50.1 18.37 ns 
47.2 6.57 52.8 12.97 ns 
55.7 7.86 40.7 5.48 -001 
49.0 7.90 45.4 10.73 ns 
47.6 8.18 54.9 11.78 ns 


"The 
raw score ratings were converted to McCall Ts. 


to : 
the notion of the need for independ- 


three s ry. 
cales, General babyi 
: ; bying: 
€nteredness of the home, and 


tecti 
iveness, are in accord witl 


. en — 
t behavior in the school setting, the 


ndings obtained here concerning 
Child 


Pro- 
h the 


the 


r 

ang of other investigations (Bald- 

loon al., 1949; Bandura & Walters, 

Oy ). In their interpretation of the 
rent, behavior ratings, the Fels re; 


Searchers identified an 
Pattern consisting of th 
“ abying, Protectiveness, 
usness, High ratings 
ables were interprete 
Cither one of two processes 1D 
ome: “(a) the mo 
doting and protective 
(b) she is anxious @ 
the child endanger } 
mode her” (Baldwin © 
On the basis ° 


“indulgence” 
ree variables, 
and Solicit- 
on these vari- 
d as indicating 


the 


ther is warmly 
of the child; or 
nd restrictive lest 
jmself or discom- 
+ al., 1949, p. 8). 
f the relationship of 


these three variables to other rating 
scales, these investigators concluded 
that “there is & definite tendency for 
those parents who are rated high on 
indulgence to be warm, emotional, 
and unable or unwilling to give the 
child either emotional or physical free- 
dom” (p. 8)- 

While two possibilities were de- 
scribed to explain high ratings on the 
jndulgence variables, the burden of the 
present discussion is to argue that, in 
the present study at least, the higher 
the homes of the well ad- 
on these variables 


indicate acceptance of the child while 
the lower ratings of the homes of the 
poorly adjusted children point to re- 
jection of the child. Great concern 


with encouraging independence may 


actually reflect, parental rejection of 
the child. Parental declarations that 


156 


the child must learn to “stand on his 
own two feet” and “fight his own 
battles” may be attempts to justify 
nonnurturant and nonsupporting be- 
havior growing out of feelings of re- 
jection. 

This interpretation is similar to 
that of Bandura and Walters (1959) 
who found that the parents of ag- 
gressive boys tended to discourage 
dependency behavior in their sons and 
that parental rejection was related to 


nonnurturant parental behavior. The 


“Indeed, Parental re- 
ve @ pervasive 
f the child’s at- 


nal support from 
his parents” (p. 68). 


Summary 


The present paper Presented data 
concerning the Telationship between 
attitudes and be- 


ild’s adjustment in 


A comparison was 
ent attitude scores and Fels home rat- 
ings of two groups of children, 10 well 
adjusted first graders and 9 Poorly 
adjusted first graders, 

The following 
from the study: 

1. Four of the 29 
differentiated signifi 
the mothers of the 
the scales reached th 
fathers. 

2. Five of the 15 Fel, 
ferentiated between the h 


made of the par- 


results emerged 
PARI subscales 
cantly between 
Sroups. None of 
€ .05 level for the 


S scales dif_ 
ome environ- 


GENE R. MEDINNUS 


ments of the two groups of children. 
Three of the 5 scales were included “ 
the Dependence vs. Independenc 
Encouraging factor. ae 
The saat were interpreted ne ae 
dicating that the lower ratings 0 es 
homes of the poorly adjusted chil ing 
on the several depenvianns anton = 
variables reflected parental mei 
which was a causal factor in the on 
dren’s poor adjustment to the dema 
of the first grade situation. 


REFERENCES 


Bauowin, A. L., Kanuorn, Joan, & anges 
Fay H. The appraisal of parent. 
havior. Psychol. Monogr. 1949, 

y Whol 4 
Aare i <o— R. H. Adolescent 
«ggression. New York: Ronald, 1959. nt 
HAMPNEY, H. The measurement of ge 
behavior, Child Develpm., 1941, 12, 1 
66. 


Lorr, M,, & Jenkins, R, L. Three faster 2 
Parent behavior. J. consult. Psycho! 
1953, 17, 306-308. first 

=DINNUS, G. R. The development of a in 
Srade adjustment scale. J. exp. Educ. 
Press, Fels 

Rorr, M. A factorial study of the F 
Parent Behavior Scales. Child Develpm 
1949, 20, 29-45, 1 

Scuazrar, -S, & Ben, R. Q. Parenta! 
Attitude esearch Instrument (PAR): 

1 data. Unpublished Bea: 

Tone’ National Institutes of Healtb, 


yee elop- 
ment oo S. & Bet, R. Q. Dev 


rental Attitude Resear 

aaattMent. Chita Develpm., 1958, 29 
9-361, 
Waryer, Ww 


iv, Mrexen, Marcus, & EsttS, 
Pr class in America, Chicago: Sor 
Search Associates, 1949, 


(Receive October 3, 1960) 


Journal of Ed: i 
lucational Psycholo: 
1961, Vol. 52, No. 3, 157-160 = 


ON THE USE OF A SHORT VOCABULARY TEST TO 
MEASURE GENERAL INTELLIGENCE 
JOHN B. MINER 


University of Oregon 


ar ene publication (Miner, 1957) 
al a summarized the results of 
ae. Dae which yielded esti- 
=n the relationship between 
$e hee ary tests and measures of what 
ilies Piggy to call general intelli- 
nee = of these studies utilized 
wide « c aracterized by a rather 
Yonah: been of ability and employed 

ulary measures of intermediate 


le 
ngth such as those contained in the. 


Lee or Stanford-Binet tests. The 
ford. 3 | measures were the 1916 Stan- 
the at the 1937 Stanford-Binet, 
West echsler-Bellevue scales, the 
“ sler Intelligence Scale for Chil- 
devel the General Classification Tests 
the ring by the Armed Services, and 
ion is tests. The median correla- 
aa an was .83. On the basis of this 
Tatas plus the knowledge that the 
Be non general measures themselves 
leet , intercorrelate at a much higher 
lie it was assumed that general in- 
tis gence can be adequately predicted 
Tt a vocabulary test. 
4 he studies of intelligence differ- 
Nees between various population sub- 
eee and of manpower utilization 
Meas the educational and occupa” 
10nal systems of this country which 
are presented in Intelligence in the 
nited States (Miner, are 


based on findings obtained with @ 
+. The particular 


Short vocabulary tes 

test used a rate in three separate 
Investigations (Hagen & Thorndike, 
1955; Thorndike, 19425 Thorndike & 
Gallup, 1944) to have 2 reliability in 
the 80 to .85 range: In view of this 
evidence of reliability and existing 
knowledge a8 regards the relationship 


between vocabulary and general in- 
telligence, the assumption that an 
adequate measure of group differences 
in intelligence was being employed 
seemed justified. 
This conclusion has, however, been 
criticized on several grounds, perhaps 
with some justification. The short vo- 
cabulary test contained only 20 items 
whereas the Wechsler and Stanford- 
Binet measures are considerably 
longer. Furthermore, the short vocabu- 
lary test used a multiple-choice for- 
mat rather than a free response Pro- 
cedure. A question was raised as to 
whether the specific vocabulary test 
used in the studies presented in Intel- 
ligence in the United States was in 
fact as valid a measure of intelligence 
as the presumptive evidence would 
suggest. No information on the cor- 
relation between the short vocabulary 
test and a more general measure was 
presented in the book, nor was such 
information available to the author at 
the time the book was published. 
The present work was carried out 
with a view to remedying this de- 
ficiency and casting further light on 
the validity of the conclusions reached 
in Intelligence in the United States. 


PROCEDURE AND RESULTS 


In connection with a study designed 
to investigate sales effectiveness, a 
number of tests were administered to 
108 members of the sales organization 
of a large corporation. The tests em- 
ployed included Forms A and B of 
the short vocabulary test originally 
developed by E. L. Thorndike for the 
IER Intelligence Scale CAVD, the 


157 


158 JOHN B. 


Concept Mastery Test (Terman & 
Oden, 1959) devised to measure the 
gifted group in adulthood, and the 
verbal subtests (excluding Vocabu- 
lary) of the Wechsler Adult Intelli- 
gence Scale. The sample was preselec- 
ted on intellectual ability since the 
company characteristically uses tests 


in the Screening of applicants. All of 
the men had h 


tion at the 


of salesman, However, 


agement level when tested. None had 
a Wechsler Verbal IQ below 100, 
The intercorrelations are presented 
in Table 1. Forms A and B of the 
short vocabulary test have been com- 
bined to provide an index of th 
tiveness of a longer (40 item) 


MINER 


evidence for selection on intelligence 
or its correlates is marked. On Form 
A, for which we have definite informa- 
tion, the mean score for the population 
as a whole (excluding those aged nine 
and below) is 10.77. The standard de- 
viation is 3.46. 

The difference in means between 
Forms A and B is reliable (t = 5.50, 
Pp < 01). This finding has been 
checked on a sample of 64 pee 
machine operators who also took bo 
forms of the test. In this latter “en 
the means were 11.59 and 13.45, al 
spectively, (t = 4.95, p < .01). en 
dike reports that Form A appears i: 
be somewhat casier at the lower pa 
and harder at the higher levels t a 
Form B. The results presented 
are consistent with such a conclusio . 

Tn view of the known restriction ° 
Tange in the present sample, it seeme 


vocabu- advisable to esti ize of the 
cS estimate the size ‘ 
— Por Form Phee employed in Correlations that might be expected 1? 
tudi The W Ts cee States © population as a whole. Such figures 
rescerat . ies Verbal Score was are more suitable for evaluating the 
p y 1binIng scaled Scores validit; f the Intelligence in the 
for the Information, Comprehension, Writer Pad he Inte ie! h utilized 
ri J imilariti aes ‘ c 
eee eh ats Digit a Sample Sica of the br 3 
: ne Population, ore suit- 
oe and 3nlOFMation able for eee an the 
viations for each of the Pe ge findings obtained the studies em- 
*  Ploying longer vocabulary _ tests) 
‘ABL 
OBTAINED Correvations FOR a bate Gnows 
N= 108) OF Sates Empioynes 
Rest us SD Vocabulary Test 
‘AIS 
Form 4 Form B seaneent Verbal 
Vocabulary Test 
Forms A & B (40| 97.64 3.77 SS 
items) -89 .89 56 
ary Test ; 13.03 2.11 73 P 
Form A (20 items .59 
Vocabulary Test 14.61 2.11 64 AT 
Form B (20 items. 
Coneept Mastery Test 61.50 26.60 -67 BA 
WAIS Verbal 67.22 6.34 we 


A SHORT VOCABULARY TEST OF INTELLIGENCE 


a characteristically were based 
mien of hospital patients, 
Sah ares service inductees, un- 
Ss i " 
iopulatie ae children, or the adult 
“i in Table 2 which in- 
WAIS he Concept Mastery Test and 
lowing = computed using the fol- 
(1950): ormula taken from Gulliksen 


I Hav = Vi— = relat 
dino formula, the standard devia- 
its of he population scores for both 
tithen: « the vocabulary test (S,) was 
Senile 3.46. In all probability the 
the hi : estimates, which average in 
Suse 80s and tend generally to 
of véliogin small amount the estimate 
etthewk ility in the population, are 
standa . inflated. Apparently the 
Blexted deviations obtained in the 
fs groups for the two forms of 
ction ee test (sy) are under- 
Seaves, due to ceiling effects. In- 
ae of the distributions suggests, 

wh on that the error introduced in 
Songs ee is not sizable. It seems 
fieuga certain that the population 

Th $ would not drop below .75. 
Rtn poration in Table 2 between 
reliabili and Form B is an estimate of 
and ility in the population as & whole 
Dineticn derived using Gulliksen’s 
Stith ion 5 (1950, p. 111). This as- 
which the two are parallel forms, 
| is apparently, not strictly 

Peaking, the case. Nevertheless, the 
ore is consistent with those previ- 
i mes reported. A similar estimate was 

erived from the tabulating machine 
Operator sample. The corrected reli- 


ability coefficient there Was .82. The 
estimates involving Form B are based 
On the assumption that the standard 
deviation in the population is the 
Same as for Form A. This assumption 
in all probability yields excessively 
Conservative estimates of correlation 


159 


TABLE 2 
EstimaTED CORRELATIONS BETWEEN TESTS 
IN THE GENERAL POPULATION 


Vocabu- 
lary Test | Concept | WAIS 
eo ie Verbal 
Vocabulary Test! 
Form A (20 85 .88 84 
items) 
Vocabulary Test 
Form B (20 89 86 
items) 


size. Lorge (1957) reports a standard 
deviation of 3.4 for Form A and 4.0 
for Form B among United States 
Army enlisted men. 

The above reference also contains 
information on correlations between 
the short vocabulary tests and the 
Army General Classification Test for 
two representative samples of enlisted 
men. The samples numbered 868 for 
Form A and 883 for Form B. The co- 
efficients were .70 and .77, respectively. 


ConcLusION 


Taken together the Lorge data and 
the present findings offer rather im- 
pressive support to those who have 
argued for the validity of short vocab- 
ulary tests as measures of intelligence. 
The 20-item tests yield correlations 
with the more general measures of in- 
telligence which are comparable to 
those found between the general meas- 
ures themselves. In the population as 
a whole the correlation between the 
short tests and tests of general intel- 
ligence appears to be at least .75. This 

es favorably with that 


figure compar 
of .83 found for the longer and pre- 


sumably somewhat more reliable vo- 
cabulary tests on which data have 
been reported previously. 

It is hoped that the publication of 
these findings will result in a greater 
use of short vocabulary tests in con- 
nection with research studies where 


160“. 


practical considerations have hereto- 
fore precluded the investigation of 
relationships between experimental 
variables and intelligence. Prior as- 
sumptions regarding the impossibility 
of achieving adequate reliability and 
validity in such short measures were 
apparently incorrect. There would 
seem to be little basis for criticizing 
any study on the score of inadequate 
measurement techniques, purely be- 
cause a short verbal measure of in- 
telligence was employed. 


REFERENCES 


GULLIKsen, H. Theory of mental tests. New 
York: Wiley, 1950. 


JOHN B. MINER 


Hacen, Evizaseto P., & THORNDIKE, por 
Normative test data for adult m' i 
obtained by house-to-house testing. 
educ. Psychol. 1955, 46, 207-216. Hoi 

Lorce, I. Vocabulary Test G-T: hie 
and norms. New Sages Columbia 

i 957. (Mimeo ‘ 

Muee, J.B. atone in the United 
States. New York: Springer, ag The 

Terman, L. M., & Open, MELITA a: Stan- 
gifted group at mid-life. Stanford: 
ford Univer. Press, 1959. ; ig tot 

THornpike, R. L. Two screening Pest hi 
verbal intelligence. J. appl. Psy 
1942, 26, 128-135 1 

Tuornprxe, R. L, & Gatur, G. Legs 
intelligence of the American a i 
gen. Psychol., 1944, 30, 75-85. 


(Received October 12, 1960) 


J 
ournal of Educational Psychology 


1961, Vol. 52, No. 3, 161-165 


V. 
ISUAL PERCEPTION OF BINARY PATTERNS BY 
PRESCHOOL CHILDREN AND BY 
SCHOOL CHILDREN’ 


DOROTHY W. DYER avo E. RA 


EB HARCUM* 


College of William and Mary 


Thi 
eg attempts to relate the 
perceptu my of a primacy effect in the 
ing to i es to progress in learn- 
usually se The primacy effect, as 
that initi > ned, is the phenomenon 
fase ee elements of a sequential 
repro on fewer errors in stimulus 
later in aa than elements occurring 
mination e sequence. In the deter- 
th et a perceptual spans, the su- 
left lee a al of elements at the 
bee we the visual pattern has often 
4 as eis to the development of 
at hee set or tendency to perceive 
Fae fer elements at the left within 
fore, aap (Anderson, 1946). There- 
Ke fhctosc elements on the left would 
cde Gn e by a primacy effect. Since 
Petpet tna left-to-right viewing se- 
lish, the fries etd for reading Eng- 
that it h esis of the present paper is 
Practice las been developed through 
Sine in reading English. By the 
argument, more accurate repro- 


1 T . . P 
ae Phas is based on a dissertation 
bartment y the senior author to the De- 

liars of Psychology of the College of 

: canes Mary in partial fulfillment of 
Arts aba for the degree of Master of 
support ee 1960). This investigation was 
and ps by PHS Research grants (B-1971 
loioat a from the Institute of Neuro- 
Publi iseases and Blindness, United States 

ic Health Service. 


2 
Our thanks are due to Anne Smith, 


School, Williams- 


y © Virginia, for 
— kind cooperation and assistance in the 
eet of this research. The authors also 
J sh to thank 8..B> Willams and Gertrude 

ohnson for helpful suggestions and critical 
Teading of the manuscript. 


161 


duction for stimuli appearing in the 
right visual field would suggest & 
right-to-left viewing sequence and, 
consequently, lesser development in 
reading skill. 

Learning or experience has been 
found to affect performance on tasks 
which require the perception of spatial 
relationships. For example, the tend- 
ency to make perceptual reversals 
decreases as progress is made in learn- 
ing to read. Smith’s (1950) “retarded” 
readers made approximately 50% 
more reversal errors than did a group 
of “advanced” readers. Similarly, in- 
nd superior readers exhibited 


ferior a 

differences in the relative number of 

errors in reproducing tachistoscopic 
two sides of fixa- 


letter-stimuli on the 
tion (Crosl 
superior readers ma 
overall, for the stimuli 
visual field the inferior readers eX- 
hibited greater accuracy: Nevertheless, 
the averages lor both superior and in- 
ferior readers revealed greater accu- 

in the left field of 


sented nonsense Engl 


words to pilingual obser 
d more English let- 


servers recognize! 

ters to the left of fixation, and more 
Hebrew letters to the right of fixation. 
To account for this, Anderson sug- 
a learned predisposition to per- 
he left-hand letters in 
d the right-hand 
s—which are, 


Heron 


errors in recogniz 


162 


in the left visual field when groups of 
letters were presented on both sides 
of fixation. According to Heron, a 
process involving a spatial-temporal 
analysis of the stimulus trace result- 
ing from even a single glance favors 


similar to those 
of letters. Harcum (1957), 


tion by the 
pendencies, 


characteristics jin verbal stimuli 


ng reveal 
roduction 
her visual 
have had 
perior re- 
left visual 


earned 
attend first to stimuli in that ere 


DOROTHY W. DYER AND E. RAE HARCUM 


Merruop 

Sixty-six. children with normal vision 
served as observers. Nursery school (V = 
13) and kindergarten (NV = 18) observers 
were enrolled in the same private school. 
All first grade (NV = 18) and second grade 
(NV = 17) observers had previously attended 
that same private school and were pupils 
at a public elementary school. All children 
returning a signed parental permission slip 
were tested, except for the kindergarten 
children of which only the first 18 were 
tested. c 

The apparatus was a Dodge-type tachis- 
toscope. The illumination on the stimulus 
field and the fixation field, provided by two 
meandescent light sources, produced a field 
luminance of about 2 foot-Lamberts. The 
observers appeared to experience no diffi- 
culty in detecting the individual stimulus 
elements. 

Each stimulus template consisted of SIX 
horizontally arranged ellipses. The ellipses 
were .360 inches (31.7’) in height, 315 inches 
(27.8') in width, and were spaced 400 inches 
(35.3’) apart. Stroke width of the ellipses wa5 
about 035 inches (3.1'). The fixation cros* 
on the fixation field registered with the 
center of the template at a distance of 2 
inches from the observer's eyes. The target 
field was 4.5 inches (9.9°) square. 

Individual patterns were made by black- 
Lew one element of the template on the 
ies and one element on the right of the 
“ation cross. Each of the six element-posi- 
ea Was filled an equal number of times. | 
an observer, viewing binocularly, first 
‘Produced three practice patterns. Each 
foe patterns was then observed (in dif- 
frent sequences) with binocular, left-eye 
‘ight-eye viewing, Each observer wes 
Pr age one of the six possible viewin& 
— of the three viewing conditions. A? 
rors by Was made to reduce fixational e' 
color . auiring the observer to report the 
eg which was periodically changed) ° 
Vila fe cross prior to exposure of HE 
exposure crmediately after each .15-secon 
be a of the pattern the observer pointer 
eee 4 blank template before him an 
in that corresponded with the oDe 
ulus. A observed as blackened in the st1™?- 
server ? error was recorded when a oe 
element war wy ,indicated that an oP? 
faces Was filled, or when he failed 

ree a blackened element. , 

© intertrial interval was determined bY 
© Pace at which the observer chose 


: ' 
VISUAL PERCEPTION OF BINARY PATTERNS 


work. Testi = 
e Sgt 5 omg each observer was 
‘4 Sn : : > 

the bsg Chart was used to determine 
cet Based of the observer, and a 
canes, ane ey'was used to check his hand- 
the Ender te ogee Reading readiness of 
wetoreeee en observers was indicated by 
bolitan ‘Bea oe Sections 1-4 of the Metro- 
mere Shira yrs Tests. Teachers’ ratings 
fluency oH “ to estimate the reading pro- 
Servers, Ci ane first and second grade ob- 
Ucvuautiie Cee prevented giving the 
children, test to the nursery school 


RESULTS 


The d: 
nonditae t from the binocular viewing 
are show ta Figo observer groups 
means a a Figure 1, in which the 
Plotted a errors per exposure are 
Sosiion # a function of the clement- 
tern, Th rom the left end of the pat- 
Viewin . data for the two monocular 
cause Niet are not shown be- 
the bin ey almost exactly duplicate 

As mate datas 
ieee be expected, there is @ 
the ities errors associated with 
evels. educational (maturational) 

Si 
fop the be shapes of the error-curves 
‘en i school and kindergar- 
same weal ers appear basically the 
and seco eg the curves for the first 
servers to grade observers, the ob- 
sta distfony categorized for purposes of 
differenc analysis of the hemifield 
preschool + into two general groups © 
differene vs. school observers. A smal 
visual Held errors favoring the right 
Citeencss appears for the preschool 
servers oh When the frequency of ob- 
ance to ee exhibit superior perform- 
Pared t e right of fixation are com- 
under ¢ “ the expected frequency 
tained diff, null hypothesis, the ob- 
the bin ifference is not significant by 
he eaecia test (p > 42). However, 
q iaremas children exhibit a consistent 
right SS favoring the stimuli in the 

visual field. This difference, 


163 


wo 


BINOCULAR 


iN 


Ww 


i) 


MEAN ERRORS / OBS, / EXP 


4 5 6 


| 2 3 
POSITION FROM LEFT 
: Fic. 1. Means of errors per element-posi- 
tion for each observer group under the bin- 
ocular viewing condition. 


tested as before by the binomial test, 
is significant (p < 012). The marked 
change in the results of preschool and 
school children is especially impressive 
when one considers the wide range of 
developmental stages represented 
within each educational level. For 
example, one child in kindergarten was 
highly proficient in reading, and his 
error-curves were quite similar to 
those of the first and second graders. 

The consistent and regular curves 
in Figure 1 represent the averaging 
out of individual differences in the rel- 
ative numbers of errors left and right 
of fixation. However, these differences 
in performance between the left and 
right fields came closer to canceling 
out for the preschool children than for 


the school children. 


The data of the school children ¢X- 


hibit a decrease in errors at the most 
eccentric element-positions and for 
the elements nearest fixation. These 
curves are similar to those frequently 
in the data of adult observers on 
q similar & Ee except for the pres- 
ence of fewer errors for elements to 
the right of fixation. 

Table 1 presents the comparison of 
the measures of handedness and eved- 


164 DOROTHY W. DYER AND E, RAE HARCUM 


TABLE 1 
RELATION oF EyEpNESS AND HANDEDNESS 
To ToraL Errors PER OBSERVER AND 
To PrRceNTAGE Errors Lerr 


oF FrxaTion 
Handedness | Measure | Left-eyed | Right-eyed 
(NV = 8) | (WV =5) 
Left Total 49.0 58.6 
%U 55 53 
(N = 20) |(N = 33) 
Right Total 52.8 53.2 
%L 56 53 


ee 


ness with the total number of errors 
made by each observer, and the per- 
centage of these errors that occurred 
for elements to the left of fixation 
Neither the total number of errors 
made by an observer, nor the Percent- 
age of his errors which were made for 
elements left of fixa’ 


tion, are sj ifi- 
cantly related to laterality prte. 
ences. 


Parison of rat- 
er reading pro- 
Trence of higher 


umber of 
to the left of fixation. The Hoque 


in this two-way classificatio 
based only on the data of the fiset 
and second grade observers, These 
two classifications are not Signifi- 
cantly related (,2 = 2.40; p > -05) 
although the difference is in the direc. 
tion of a positive relation between 
relatively more errors left of fixation 
and teachers’ ratings of above average 
reading proficiency. 


Discussion 


The present study provides evi- 
dence in support of the first hypothesis 
—that the preschool observers do not 
manifest differences in accuracy of 
performance between the left and 
right hemifields. However, the second 
hypothesis, which specifically predicts 
the development of superiority for ele- 


ments in the left visual field coincident 
with the onset of instruction in read- 
ing, is not confirmed. Fewer errors 
Were made in reproducing stimuli in 
the right visual field, rather than in 
the left visual field. Since this signifi- 
cant difference in the relative number 
of errors appeared rather abruptly 
with the start of schooling, and since 
it was not found in the data of either 
of the two preschool groups, it would 
Seem more reasonably due to instruc- 
tion in reading rather than to matura- 
tion. 

Several mechanisms may be pro- 
Posed for relating instruction in read- 
ing to superior reproduction for stimuli 
in the right visual field. Children who 
have had instruction in reading may 
tend to look toward the right end of 
the visual pattern (i.e, fixate incor! 
fectly) in this task. But this does not 
“cem likely. Similarly, the childre 
may have acquired right-to-left 
Viewing sequence in the early stages 
of learning to read, but this also seen}s 
ee. On the other hand, the chil- 
i ne may have developed a left-to- 
be Scanning sequence as predicted, 

ut, for observers of this developmen- 
tal level, recene be 
sttonces 3 y effects may 

atte. Ser than primacy effects. This 
pla * Mechanism seems to be the most 

Usible jn view of tl Il, al- 
though no Y of the small, 
Positiy Statistically significant, 
Of reg; lationship between ratings 

Teading Proficj . 
flency and greatel 


Average r, 
‘eader| ve average 
OF beh Above 


Above 50%, eader 


VISUAL PERCEPTION OF BINARY PATTERNS 


numbers of errors to the left of fixa- 
tion. 


SuMMARY 


Very simple bina atterns were 
observed by eae Mahoal, kinder- 
arbi first grade, and second grade 

‘ ool children with left-eye, right- 
om and binocular viewing. It was 
i are ig on the basis of data 
ot obtained from adult ob- 
te dine that the relative accuracy for 
roa ucing elements to the left of 
ir would increase with school 
pe ing in the left-to-right reading 
sequence of English. 

Me preschool observers exhibited 
ioe ped greater accuracy for 
i uli in the right hemifield. Con- 
tick hi the hypothesis, significant 
ae emifield superiority was ob- 
eas a in the school children. Ap- 
childtr y, perceptual mechanisms of 
Be ee at the age at which they start 
faire lo not wholly duplicate adult 
nisms. 
cong mPortant effect of viewing-eye 
ion, or of eyedness, handedness, 


165 


or “conflicting” eyedness-handedness 
of the observers was found. 


REFERENCES 


Awnperson, I. H. The effect of letter-position 
on range of apprehension scores, with 
special reference to reading disability. 
U. Mich. Sch. Educ. Bull., 1946, 18, 37- 


CrosLanp, H. R. Superior elementary-school 
readers contrasted with inferior readers 
in letter-position, “range of attention,” 
scores. J. educ. Res., 1939, 32, 410-427. 

Dyer, Dororuy W. Perception of binary 
yisual patterns by pre-school children 
and by school children. Unpublished 
master’s dissertation, College of William 
and Mary, 1960. 

Harcun, B. R. Three inferred factors in the 
visual recognition of binary targets. In 
J. W. Wulfeck & J. H. Taylor (Eds.), 
Form discrimination as related to mili- 
tary problems. ee Publ. 561) 


Pp. 32-37. . 
Heron, W. Perception as & function of 
retinal locus and attention. Amer. 

Psychol., 1957, 70, 38-48. 

Smrra, Lipa C. A study of laterality char- 
acteristics of retarded readers and read- 
ing achievers. J. exp. Educ., 1950, 18, 
321-329. 


(Received November 4, 1960) 


Journal of Educational Psychology 
1961, Vol. 52, No. 3, 166-172 


FIXED SEQUENCE VERSUS BRANCHING 
AUTOINSTRUCTIONAL METHODS 
HARRY F. SILBERMAN, RALPH J. MELARAGNO, JOHN E. COULSON, 
and DONALD ESTAVAN 
System Development Corporation 


Two basic autoinstructional meth- 
ods may be identified. The one most 
frequently used displays a fixed se- 
quence of teaching items, requiring all 
subjects to traverse all items (Skin- 
ner, 1958). An alternative method has 
been suggested (Coulson & Silberman, 
1960; Crowder, 1960) which uses a 
variable sequence of items, allowing 
each student to branch to remedial 
materials when needed. 

With a fixed-sequence teaching 
machine, adaptation to individual dif- 
ferences is provided only by the var- 


iable rate at which students move 
through 


the sequence. Variable-se- 
quence teaching machines provide for 
individual differences by using a va- 
riety of criteria, such as error rate and 
error type, for branching to additional 
remedial items and for skipping over 
redundant items. Each student gen- 
erates his own unique sequence of 
items to effect greater learning effi- 
ciency. The basis for the design of an 
autoinstructional method in which the 
sequence of instructional material is 
tailored to the needs of each student 
is the notion that there is no single 
best item sequence nor one best 
method of presenting the materials, It 
is assumed that different students re- 
quire different items with varying 
amounts of review at different times 
during training. 

Coulson and Silberman (1960) at- 
tempted to evaluate the effectiveness 
of branching versus fixed sequence in- 
structional methods. Using a manu- 
ally operated machine they compared 
a fixed item sequence with a variable 


sequence in which some items Lp 
skipped by students who made n0 Eo 
rors. The findings showed that § 4 
dents trained with the very al 
branching procedure used in on 
study learned as well as en 
trained without branching, and o 
learned in a shorter time. The i 
of instructional material was not ¢° : 
trolled, since subjects in the branohiné 
group had skipped certain items a 
by the fixed-sequence group. In “8 
a study, obtained differences can : 
attributed to the difference in ite™ 
Presented as well as to the effects : 
adjusting the instructional sequence 
to the needs of individual students. 
In the Coulson and Silberma? 
(1960) study, the wishes of the stU- 
dent were not considered in determ!?” 
ing whether he was branched back to 
Preceding items or skipped ahead to 
subsequent material. Nor was ero 
Tate over a Sequence of items used a5 
a branching criterion, The branching 
depended entirely on the errors madé 


n individual items. 

ins Present study was concerned 
With two methods of branching. 19 
XDerimen 


i » & pr er- 
mitted the student es ae aie 
: Was evaluated, while i? 
periment 1 the use of errors as @ 
The hy 7 “niterion was attempted. 
Was “tg tested in Experiment 
quences a 1 °Ptional-branching se- 
sequence: oa Superior to a fixed 
g YPothesis in Experi- 

% 40 error-generated 
Would be superior 


FIXE E ri ; 
XED SEQUENCE VERSUS BRANCHING IN 


to a fix 
he xed sequence when identical 
re used in each sequence. 


Experiment I 


Tw 
de aroma relating to optional 
Gack a A ip tested. The first was 
tie lene a ed-sequence program would 
OAs as at on than the same items 
through atements in textbook format 
jane Fag the student could 
a his own option. The ease of 
re tg the facility for skipping 
the thie ay the textbook format were 
ond hy hs this hypothesis. The sec- 
On ee nests was that performance 
dent to ti which allowed the stu- 
iste ake ei previous material, one 
iene ime, would also exceed per- 
ce on a fixed-sequence program. 


Procedure 


Stude 
nt, : . 
Angeles ies from five high schools in Los 
Subjects out served as paid subjects. 
ior classes, 5 obtained from junior and sen- 
enmon-N, = were pretested with the 
Form A “Geat Test of Mental Ability 
Ment to thr rades 9-12), Random assign- 
males ead treatment groups resulted in 
WO grou 10 females being assigned to 
being pr and 6 males and 11 females 
17 subj igned to the third group, providing 
ie eeeu aroun 
on were ie multiple-choice teaching items 
eWentay anaes logic were prepared and 
Seconda preliminary trial in a number of 
on 5 gs schools. Items were revised, typed 
binders { cards, and mounted in loose-leaf 
Included or the experiment. Topics covered 
Connecti simple and compound statements, 
gumsaie and their combinations, and ar- 
ee: instructional methods were used. 
(He rst group, designated “fixed sequence 
the Nmon-Nelson Mean = 114.1) received 
Sub teaching items in % fixed sequence. 
Biy jects in this group were instructed to 
and & covert response tO each item 1n turn, 
rect to compare their answer with the — 
ba answer, which was jnseribed on the 
ee ck of the card. Subjects were ee in- 
Tucted not to turn pack to prece Bnew 
~he second designated back branch- 
ing» group, Ba ey Fe 
8,’ (He Nelson Me 1) re 
cei pe «¢ items but was 
ived the same 61 teaching lem. fe 
Permitted to revi preceding items by 


” 
” 


AUTOINSTRUCTION 167 
backing up one jtem-card at a time. Sub- 
jects in this group were encouraged to re- 
view earlier material and were instructed 
not to skip any items, so that they would 
view each item at least once. The third 
group, designated “textbook,” (Henmon- 
Nelson Mean = 111.3) received a type- 
written text produced by rewriting the 
original items. Each item was converted 
from a question into a statement by in- 
corporating the correct answers, eliminating 
distractors and filling in blanks. These state- 
ments were then organized into paragraphs. 
Subjects in the third group were instructed 
to study the material in whatever manner 
they chose. All subjects were observed 
through one-way windows to insure that 
they were following directions and, addi- 
tionally, to check on whether subjects in 
the back-branching group exercised the 
branching capability. 

Mastery of the topics covered in the 
teaching session was tested with 24 free-re- 


sponse and 24 multiple-choice test questions. 
Half of the questions were similar to the 
actual training materials and half consisted 
of situations which required application or 
transfer of learning. Split-half reliability of 
the posttest wa: 


3 93 (after Spearman-Brown 
correction). The test was administered im- 
mediately 


after the training session. 


Results 

Comparison of the criterion scores 
(Table 1) for the textbook and the 
fixed-sequence groups confirms the hy- 
pothesis that the textbook method, 
which permits the student to branch 
at his own option, is superior. The 
second hypothesis, that the back- 
branching method would be superior 
to the fixed-sequence method, was not 


confirmed. 


TABLE 1 


7 SCORES AND TRAINING TIME 


PostTTES' 


Time (in 


Criterion 
minutes) 


Posttest 


Treatment 


Fixed Sequence 
Back Branching 


28.4] 8.7/32.8 
17. [30.8] 9.7/31-8 
Textbook 33.7} 6.4/31.5 


17 


168 


TABLE 2 


CoRRELATIONS oF ApTiITuDE, TRAINING 
Time, AND Postrest Scorp 


‘ Back 
Fixed " 
Branch- |Textbook 

Sequence ing 


Aptitude vs. Score 45 39 


<674%* 
Aptitude vs. Time | —.52*) — 45 ook 
Time vs. Score —.14 -03 |—.04 


*p<.05, 
"p< .01. 


For the first hypothesis, that the 
textbook method would be superior to 
the fixed-sequence method, ¢ = 1.79, 
p < .05. For the second hypothesis, 
that the back-branching method would 


be superior to the fixed-sequence 
method, t = 80, p > .10. 


Correlations of Henmon. 
titude scores, criterion pos 
and training times are giv 
2. 


-Nelson ap- 
ttest scores, 
en in Table 


Exprrmenr II 


A branching Program, using error 
rate as a branching decision criterion 
was compared with a fixed-sequence 
program with instructional materials 
equated for the two groups. It was 
hypothesized that performance of sub- 
jects receiving a branching Sequence 
would be superior, with Tespect to 
scores on a criterion Posttest, to that 
of students receiving a fixed sequence, 
This was predicted on the basis of the 
teaching machine’s responsiveness to 
the student’s performance, By provid- 
ing extra remedial items to students 
who have difficulty, the machine tends 
to minimize the number of items seen 
by the brighter student and to min- 
imize the number of errors on trainin, 
items made by the low aptitude stu- 
dent. The overall effect is intended to 
increase the amount of learning in g 
given time for each individual. 


SILBERMAN, MELARAGNO, COULSON, AND ESTAVAN 


Method 


se 
Apparatus. A high speed, general ee 
digital computer (Bendix G-15), ae C2 
with a 600-slide, random-access ali e es 
jector, and an electric typewriter, hac 
as a teaching machine. Hach slide con aie 
one multiple-choice instructional item. ae 
ject used the typewriter to insert his an) 
and receive knowledge of results. select 
The computer was programed to ieee 
a variable sequence of slides from & ve BY 
tor in response to different errors ma num- 
subject. If subject was exceeding the 1 for 
ber of errors that had been prespecified to 
4 particular topic, he would be branche ale 
a set of less difficult items. If he did Re al 
on this set he would be shunted to ane 
easier sets on the same topic before aie ie 
on to the next topic. Some of the mu a oiee 
choice items contained distractors the c! dare 
of which indicated a particular misuD op- 
Standing. Subjects who selected these 4 
tions were branched immediately to ae 
dial slides designed to clear up that por a 
Instructional Materials, Four ae ¥ ic 
and eleyen multiple-choice items oD oe 
(Silberman, Coulson, Melaragno, Saale 
& Englund, 1960) were prepared and giv % 
4 preliminary trial in a number of secondary 
schools. Items were revised and ie Sea 
35-mm. slides for use in the random-acces 
Projector, ded 
. Topies covered by these items ineluge, 
simple and compound statements, logica 


connectives, and six argument forms. 10° 
Structional 


culty levels, 
50 as to con; 
formation, a: 
© items 
tially in the 


telatively easy to answers 
small €p between items was kept 
and the Prompts were increased. 

irty-six students (21 males 
Were selected from four 

s Angeles County. Sub- 

their feat seni Thigh eeated Sina = 

i i e 

Pretested with the Hate ne of 
ental Ability (Form A, Grades 9-12) and 
Withee to two groups. Sub- 
matter (logic) used in hp td Mig! subject 
Procedure. One treatment group es 


FIXED 
SEQUENCE VERSUS BRANCHING IN AUTOINSTRUCTION 


ignate P 
the oh branching group and the other 
Mevi bers i ee or nonbranching group, 
Sequences of the branching group received 
that were m items determined by the errors 
he Ret during the teaching session. 
and also a * kept cumulative error records 
items. Thr record of errors on individual 
selected egies the session the machine 
tional ison : ys sequence of instruc” 
error criteria In ae based upon 
was given a n this way, the bright student 
the slower i i parte of material, while 
material on udent was given extra, remedial 
mitted, topics in which errors were com- 
Each 
was thi oe pa of the fixed-sequence group 
the paren random with one member of 
of items hic, group. The unique sequence 
group gener each subject in the branching 
in the ise ae was presented to his mate 
siblects Sa ate eee Thus, pairs of 
branching gr the fixed-sequence and the 
quences eee received identical item se- 
groups es . difference between the two 
to the th hat the machine was responsive 
ing group b made by subjects in the branch- 
made by Ao was not responsive to errors 
_ If the b bjects in the nonbranching group. 
jects was eT member of a pair of sub- 
ing sequena the branching group, the result- 
Provide eff e was expected to be too brief to 
the Saleen instruction for his mate in 
member cero cee group. If the brighter 
xed ecqnenige pair of subjects was in the 
received w. ce group, the sequence which he 
Necessa; iba expected to be longer than 
and bore ‘or him, resulting in wasted time 
Knowl aes 
a Wimewnt ge of results, which consisted of 
Correct nee printout of the number of the 
groups. O; swer, was the same for both 
Subjects subject was trained at a time. 
operation s yee given an explanation of the 
© familia s ite equipment and permitted 
on the first e themselves with the machine 
middle two introductory items. In the 
of the session, subjects were given & 


sho: 
rt break, Immediately after training was 
sttest was ad- 


No time restric- 


ti 2 " 
lons were placed on subjects either during 


Senne period or d' 
riterion Test. Mas' 


r *. 
free-response test items. 
ilar 


jems were quite sim 

é & materials an! ie 
‘pplication or transfer 
Jons not encountered } 


169 


TABLE 3 
APTITUDE, TRAINING, AND PosTTRAINING 
PERFORMANCE 

Treatment 

Groups* 
‘Branch- Bae 
ane quence 
Number of Instruc- M 156.00}156.00 
tional Items Range|69-248|69-248 
Training Errors M 24.66) 21.94 
SD 17.30} 13.57 
Training Time (min- M 78.83] 81.61 
utes) SD 27.34] 29.21 
Henmon-Nelson 1Q’s | @ 123. 61/122.61 
SD 19.16] 17.89 
Posttraining Criterion} Af 66.33] 64.33 
Scores SD 15.93) 16.42 


8 NV = 18 for each treatment group. 
‘ability of the test was 97 
rection). 


The split-half reli 
(after Spearman-Brown Co! 


Results 
Table 3 gives data on the number 
of items traversed by subjects, the 


number of errors made on instruc- 
tional items, and the training time. 
The mean time required for each item 
was 17 seconds. Table 3 also contains 


jnformation on the Henmon-Nelson 
aptitude pretest and the criterion post- 


test. 
Covariance analysis of criterion 
ntrolling training 


posttest scores, CO 
time and aptitude, yielded no signifi- 
cant difference between the branching 


and the fixed-sequence groups. 
Intercorrelations of aptitude, crite- 

rion score, training time, and instrue- 

tional errors are given in Table 4. 


Discussion 


ents were con- 


While both experim 
they are not 


cerned with branching, 


170 


TABLE 4 
INTERCORRELATIONS OF APTITUDE, CRITE- 
RION Scores, TRAINING TIME, AND 
InstructionaL Errors 


Branching | Fixed | 
Aptitude vs. Score 763* .845* 
Aptitude vs. Errors — .677* | —.769* 
Aptitude vs. Time — .833* | —.407 
Time vs. Score —.682* | —.179 
Errors ys. Score —.904* | —.846* 
Errors ys. Time »193* -472 


“p< ol. 


directly comparable in that different, 
apparatus and procedures were used. 

Overall, the results of Experiment I 
indicate that the relationship between 
flexibility of materials and tate of 
learning favors the flexible materials 
over the fixed sequence, Certain im- 
plications for further exploration are 
evident when this study is compared 
with those by Holland (1960) and 
Cook and Spitzer (1960), which con- 
tain features similar to it. 

With a semester course in introduc- 
tory psychology as his subject, matter, 
Holland also used three instructional 
groups. Members of the first group 
were trained in the conventional man- 
ner, with confirmation of the correct 
answer following subject’s overt re- 
sponse. In his second group, subjects 
received no confirmation following 
their response. 

Holland found no difference in per- 
formance between the confirmation 
program and the same program with- 
out the confirmation. Both of these 
programs were superior to a no-re- 
sponse, control program which only 
required subjects to read complete 
statements. Holland interpreted his 
results as lending support for the prin- 
ciple that an overt response has to o¢- 
cur for efficient learning. Insofar ag 
Holland’s control group can be thought 
of as using a prompting procedure, his 


SILBERMAN, MELARAGNO, COULSON, AND ESTAVAN 


results are in support of confirmation 
as a better teaching method. 

Cook and Spitzer (1960) seported 
results of a study using a short list 0 
paired-associates which seem to ae 
flict with Holland’s findings. A 2 X : 
design was employed in which the ex- 
perimental conditions were prompting 
and confirmation, with overt response 
or no response used in each a 
Cook and Spitzer concluded that over" 
responses interfered with learning 2! 
that the prompting procedure was i 
perior to confirmation because of the 
enforced delay between stimulus an 
response terms in the confirmation 
method. k 

The superiority of the textboo 
group over the fixed-sequence group 9 
Experiment I lends support to the 
Prompting side of the prompting-con- 
firming controversy, when no overt ¥ os 
Sponse is required, since the textboo 
condition corresponds to the prompt- 
ing procedure while the fixed-sequence 
condition used a confirmation proce 
dure. 

Since the three studies involved m2- 
terials which were dissimilar in length 
and quality, it remains for further re 
Search to compare the influences of re- 
Sponse requirement, S-R delay, al! 
optional selection of material. 

. 0 possible conclusions can be ob- 
tained from Experiment II. The first 
Is that a teaching machine which 
branches on the basis of errors is not 
Significantly superior to a machine 
Which displays a fixed sequence of 
items. An alternative explanation is 
that the error criteria used for branch- 
Wg in this experiment, and the reme- 


dial items to which a student who was 
having difficulty 
inefficient and obscure any 


he two meth- 


FIXED SE eh 
IXED SEQUENCE VERSUS BRANCHING IN AUTOINSTRUCTION 


cr of the criterion tests shows 
en ow aptitude subjects in both 
oe —os learn much of the 
aaa & While this had been pre- 
dice ri the fixed-sequence group, 
ae ee been expected that mem- 
would the branching group, who 
Re as ei receive a sequence 
eee y tailored to their individual 
ean ee receive the training nec- 
feta Tk insure their learning the ma- 
ih aoiee his was the case for only two 
ay the Mi subjects with low aptitudes 
subj Pal ace: group. Both of these 
remedial [ oe frequently branched to 
their ep items, and both scored above 
test, thee mean on the criterion 
subjects j ve remaining low aptitude 
a aes in the branching group failed 
All six at or near their group’s mean. 
group baa in the fixed-sequence 
fos van low aptitudes fell well be- 
Thr eir group’s mean. 
for ‘owe factors which may account 
tema Aig agin of the remedial 
the — low aptitude subjects are: 
inabilit, ed variety of these items, the 
i vad = subjects to branch out of 
culties “ ial series when their diffi- 
apron iad been resolved, and the in- 
Tener mateness ; of the particular 
rors) re of individual differences (er- 
ms used as the criterion for branch- 
& decisions. 

ce the instructional items in the 
to th jal branches were very similar 
& ose in the main series, except that 
Sch subject matter had been divided 
ti © small steps, the mode of instruc- 
ion in Experiment IT was of limited 
Versatility. That is, 2° attempt was 
made to alter the teaching method for 
& given subject who was having dif- 
culty. It is postulated that the low 
aptitude subjects who scored well 
actually received remedial material 
which assisted them, while the low 
d poorly were 


aptitude subjects who di r r 
Not assisted becaus© the machine sim- 


171 


ply was not flexible enough to find a 
set of items adequate for them. 

When a low aptitude subject was 
branched to remedial material, no at- 
tempt was made to evaluate the sub- 
ject’s correct responses in order to 
branch him back to the main set of 
items when he demonstrated profi- 
ciency. Instead, subject was obliged to 
remain in the remedial branch until 
he had negotiated all of the items in 
that branch. For many of the low ap- 
titude subjects, this meant that they 
were forced to see more items than 


necessary. 
ment II it was assumed 


In Experi 
that the dimensions of individual dif- 


ferences measured by errors on in- 
structional items interacted with the 
effectiveness of the item sequence pre- 
sented. The machine was, conse- 
quently, programed to treat subjects 
differentially (1.¢., giving different 
item sequences) with respect to the 


dimension of error rate. The results 
lar method of 


show that the particu 
providing for individual differences 
used in this experiment was not a suf- 
ficient condition for effective learning 
of the logic lesson. Perhaps the com- 
mon principle of teaching that one 
should ide for individual dif- 


“provi 
ferences” needs to be qualified with 
the specific conditi 


jons for its accom- 
plishment. Tt may be conjectured that 
some measures SU 


ch as response la- 

tency or subject’s self-evaluation are 
more appropriate than error rate, and 
that the computer should have con- 
sidered these behavior measures for its 
instead of, or in 


branching decisions 
addition to, errors. 
SuMMARY 


ments investigated the ef- 
f two methods of branch- 
ing, as measured by posttraining cri- 
terion tests. In Experiment I, three 
groups of 17 subjects were used to test 


The exper 
fectiveness 0 


172 


two hypotheses concerning optional 
branching. A fixed-sequence group re- 
ceived items in fixed order; a back- 
branching group receiving the same 
items as the first group, was permitted 
to back up one item at a time to 
review earlier items; a third group re- 
ceived the same items cast in state- 
ment form and organized into para- 
graphs permitting subjects to choose 
material at their own option. A sig- 
nificant difference on a posttest in 
favor of the third group was obtained 
when the first and third groups were 
compared. 

In Experiment II,a computer-con- 


trolled teaching machine was used to 
evaluate the effectiy. 


items which 
branching group received w 
sented to his mate in the 
quence group. Covariance an: 


aS pre- 


fixed-ge- 
alysis of 


SILBERMAN, MELARAGNO, COULSON, AND ESTAVAN 


criterion scores using aptitude 
training time as control varia 
yielded no significant ee as 
tween the branching and yp 
quence conditions. A discussion 0: _ 
limitations of the study and of Pia 
sibilities for further research on 
problem was presented. 


REFERENCES ae 
Coox, J. O., & Sprrzer, M. E. ger 
tary report: Prompting versus ee 
mation in paired associate learn! 
exp. Psychol., 1960, 59, 275-276. Rifects 
Coutson, J. E., & SivperMaN, H. F. hina: 
of three variables in a teaching ie 
J. educ. Psychol, 1960, 51, 135-143. 
Crowner, N. A. Automatic tutoring ae 
trinsic programming. In A. A. ching 
daine & R. Glaser (Ids.), eee 
machines and programmed Rag 
Washington: National Education 
ciation, 1960. . se. 
OLLAND, J. G. Program design ay vel 
Paper read at American Psycho aa. 
Association, Chicago, September Fane 
Stuperman, H. F., Courson, J. E., a 
RAGNO, R. J., Estavan, D. P., & balan 
DoE, Teaching machine _ progr: 
Logic. (TM-534) Santa Monica, ran 
spate Development Corpora 


Skiyner, B. FP. Teaching machines. Sciene® 
1958, 128, 969-977, 


(Received November 7, 1960) 


Journal of Educati 
tional Psychol: 
1961, Vol. 52, No. 3, 173-178 a 


A RE-EXAMINATION OF COLLEGE PRODUCTIVITY’ 


ALEXANDER 
National Merit Schol 


Several recent investigati 
gations have 
a to evaluate what part the 
ni i educational experi- 
tee A ays in stimulating him to go on 
the és on degree. The first of these, 
(1952), ies of Knapp and Goodrich 
(1953) a Knapp and Greenbaum 
pea efined the productivity of 
Wamta ergraduate college as the per- 
ally ed of its graduates who eventu- 
study the PhD. In a subsequent 
instititi olland (1957) showed that an 
park 2 8 productivity is related in 
et Al the academic ability of the 
rinethot who enroll; that is, the highly 
btoportion institution has a greater 
ats st of talented students among 
produ ee than does the less 
lead ee institution. Following this 
ae ee (1959a, 1959b) 
dent neta with respect to stu- 
nD ‘ rd by adjusting each college’s 
“ale me uctivity rates in terms of its 
acader supply” (that is, the estimated 
re ability of the student body). 
ities correlations as high as .83 
ito n these adjusted productivity 
Dress! and certain “environmental 
Brod seca pene of 36 colleges. 
Slornee ivity of PhD’s in the natural 
Sine (NS) was correlated with a 
Wis at different set of press than 
ie productivity in the arts, humani- 
8, and social sciences (AHSS). 
a € present study attempts to ex- 
1 nd this work by exploring the fol- 
Owing questions: 
es This study is a part of the research 
gram at the National Merit Scholarship 
froxPoration and was supported by a grant 
2m the National Science Foundation. 
The author is inde! Le ee 2 Hol- 


land, Laura Kent, 9° af 
the manuscript. 


thwaite for reviewin& 
173 


W. ASTIN? 
larship Corporation 


1. Is ability the only quality of the 
student body which is related to a col- 
lege’s productivity? Are there not 
other systematic differences among the 
student bodies enrolling at various in- 
stitutions which are related to PhD 
productivity? One such difference 
might be in the proportion of entering 
students who plan careers in either NS 
or AHSS. Certainly any college which 
is attended predominantly by students 
who plan careers in the natural 
sciences can be expected to have a 
relatively high proportion of gradu- 
ates who go on for the PhD in NS, in- 
dependent of any special effects of the 
college environment. Conversely, this 
same college would probably have a 
low productivity rate of AHSS PhD’s. 
Two other characteristics of entering 
students which might affect the pro- 
ductivity of a given school are the 
ratio of men to women, and the pro- 
portion of students who already intend 
to get the PhD degree. 

2. If the educational goals, major 
fields, and sex ratios of entering stu- 
dents are related to productivity, are 
they also responsible in some measure 
for the correlations found earlier be- 
tween “college press” and productiv- 
ity? These correlations have been in- 
terpreted as evidence for a causative 
role of the college on the student’s 
motivation; ¢-8- “the results... in- 
dicate that the teacher plays an im- 
portant role in motivating talented 
undergraduates to seek advanced de- 
grees” (Thistlethwaite, 1959a, p. 75); 
“The analysis confirms once again the 
view that one type of college press 
stimulates achievement in the natural 
hile a different type fa- 


sciences, WH 
cilitates achievement in the arts, 


ae ee 


174 


humanities, and social sciences” 
(Thistlethwaite, 1959b, p. 188). How- 
ever, since the information on college 
press came from students at each in- 
stitution, it may reflect differences 
among student bodies which existed 
prior to the students’ entrance into 
college, differences which are not a 
product of any unique qualities of the 
college environment. Before we can 
accurately assess the motivating func- 
tion of the college itself, we must find 
out the extent to which these “pre- 
college” characteristics of the student 
body are related to the college's pro- 
ductivity. 

Specifically, this study examined: 

1. The extent to which productivity 
rates of NS and AHSS PhD’s (ad- 
justed for talent supply) are a fune- 
tion of: (a) the proportion of students 
who, prior to entering college, intend 
to major in natural science; (b) the 
proportion of students who, prior to 
entering college, aspire to the PhD de- 
gree; and (c) the ratio of men to 
women among students enrolling in 
the college. 

2. The extent to which rel 
previously obtained betwee 
PhD productivity and colle 
modified, when the effec 
additional precollege vari 
moved. 


ationships 
N adjusted 
Se press are 
ts of these 
ables are re- 


Meruop 


College Input Measures 


Information on three precollege or input 
variables—the student's sex, probable col- 
lege major, and highest degree aspired to_ 
was drawn from a questionnaire which had 
been mailed to a 10% random sample of al] 
high school students who scored above the 
64th percentile on the 1959 National Merit 
Scholarship Qualifying Test. Of the students 
who returned the questionnaires, 2,884 at. 
tended one of the 36 institutions about 
which college press data had been obtained 
in the earlier studies. The number of sty_ 
dents attending each institution ranged from 
15 to 208, with a median of 72. 


ALEXANDER W. ASTIN 


The following three input measures Me 
obtained for each of the 36 colleges: aw 
Percentage of students who  origina’y 
planned to major in natural science (7% N 
Majors), the percentage of students pee 
to the PhD degree (% PhD Aspirants), 20 
the percentage of males among the entering 
students (% Males) > 


ReEsvuuts 


Table 1 shows the correlations be- 
tween the three input measures a0 
the adjusted productivity indexes ° 
the 36 colleges.t It is evident that a 
Significant amount of the hoor 
in adjusted PhD productivity can atil 
be accounted for on the basis of dif- 
ferences among the entering students. 
To obtain a productivity index which 
is independent of the quality of the 
student body, then, requires the use 0 
corrections for variables other tha? 
Just aptitude level, 


_ The intercorrelations of the three 
mput variables are given in Table 2 

© estimate the unique contributions 
of these variables to productivity; 
their partial correlations with produc- 
tivity rates were computed (Table 3). 


*In order to make these distributions 
more nearly normal, all percentages were 
converted by means of the aresin transfor- 
thenen (Guilford, 1954, pp, 574-576). Since 
Sara measures were based on limite 
a Ples of the 1959 freshman classes at the 
. colleges, two checks were made on the 
Piet antativeness of the NMSQT samples: 
Seaite a ales was correlated with the pel- 
Sines Ca males among the entire freshman 
in Fall 195802 of these 36 institution’ 
Office ap Ee 4s reported by United State: 
w: 

as 97. § d, mean Scholastic Aptitude 
antes (Ee y, Sia were available on the 
of the 36 


assumptions were made iE Teports similar 


ent supply and “college Press eSpect to tal- 


RE-EXAMINATION OF COLLEGE PRODUCTIVITY 


Ta : 
ciate it clear that % Males 
duel neither NS nor AHSS pro- 
ro ay beyond the combined con- 
PhD As of % NS Majors and % 
in i cmap (This would suggest, 
emis = that differences in the pro- 
the PhD men and women who get 
el be accounted for by 
per 3 in their precollege levels 
study.) i and intended fields of 
deenGes : hese findings prompted the 
were o omit % Males as an input 
Majors , and to deal only with % NS 
avi % PhD Aspirants. 
% NSM shows the correlations of 
with tho ajors and % PhD Aspirants 
Bediles ae Faculty and Student Press 
ah i which were previously reported 
Significantly related to adjusted 


— TABLE 1 
Winwee BETWEEN THREE Input 
: ES AND THE Apsustep" PhD 
Ropuetivity Rates oF 36 
INSTITUTIONS 


Input Variables 
ye 


% PhD 
Majors Aspir- |% Males 
ants 
Adju 
ie Productivity in oot? | .68** _45¢* 
Adjustar Science (NS) 
ed Productivity in | —.69°* “12 _.20 


A 
ae Humanities, and 
cial Science (AHSS) 


defined as the dis- 
expected rate of pro- 
m its talent supply) 
(Thistlethwaite, 


Baas 
ee productivity is 
ae between a college's 
sane PhD’s (as predicted fror 

its actual productivity rate 


_,, 19590). 
P<.01. 
TABLE 2 
InTERcoRRELATIONS OF THREE 
InpuT VARIABLES 
4 PhD % Males 
Aspirants 
Bios 2: 
ee 50** 
46 “49** 


% NS Majors 


% PhD Aspirants 


“p< .01. 


175 


TABLE 3 
SEconD-ORDER PARTIAL CorRELATIONS 
SHOWING THE INDEPENDENT ContRI- 

pution or Each Input VARIABLE 
‘po ApsusTeD PhD PRODUCTIVITY 


Input Variables 


acer RR 
NS | % 
NS, | Cepkants | % Males 
Adjusted NS 41% | .54** 03 
Productivity 
Adjusted ‘AHSS | —.83** .66** .00 
Productivity 
p< 05. 
p< .0l. 
TABLE 4 
CorrELATIONS OF Two Input VARIABLES 
with 13 COLLEGE Press ScaLes AT 


36 INSTITUTIONS 


Input Variable 
SS 
% NS % PhD 
Majors Aspirants 
Student Press 
Humanism —.39* | -36* 
Breadth of interests —.47**|  .08 
Reflectiveness —.29 45** 
Participation 12 |-.41* 
Aggression .pge*} 18 
Scientism .4g**] .64** 
Social conformity — .86* =.52"* 
Faculty Press 
Excellence of social sci- —.71*#}—.04 
ence faculty and re- 
sources 
Flexibility of curriculum —.56**} .03 
Energy and controver- —.27 31 
siality of jnstruction 
Informality and warmth —.04 “5b** 
of student-faculty con- 
tacts 
Closeness of supervision .03 |—.45** 
Directiveness of teaching —.41* |—.34* 
methods 
ep <.05. 
ep < Ol. 
productivity rates (Thistlethwaite, 
). Nine of the 13 


1959b, pP- 187-188 
college press scales are significantly 


(p < .01) related to one or both of the 


176 ALEXANDER 


TABL! 


W. ASTIN 


E5 


Contrisution or Two Input VartaBLes To CorRELATIONS BETWEEN COLLEGE 


PRESS AND ADJUSTED 


PaD Propvucrivity 


Adjusted NS Productivity 


Adjusted AHSS Productivity 


Original i igi Partial 
Correlation | Conelaiion> | Original, | Partial, 
Student Press 
Humanism —.07 —.25 afi** .89* 
Breadth of interests —.27 ~.26 67** .42* 
Reflectiveness —.05 — .38* 67** .40* 
Participation -00 .21 — .53** — .42* 
Aggression .43** .29 — 5Aen —,13 
Scientism .59** 18 03 33 
Social conformity —.42* —.10 03 — .03 
Faculty Press 7 
Excellence of social science fac-| — _49« —.33 83e* .55** 
ulty and resources 7 i 
Flexibility of curriculum == J97 = L0H 6g** 33 
Energy and controversiality of —.13 ~|35* ae “35* 
instruction 58 
Informality and warmth of stu- 434% 4 
dent-faculty contacts 32 -40* ce 
Closeness of supervision —.220 23 
Directiveness of teaching meth-| _ “4g 00 — -38*e ne 
ods —.16 24 ste 
® From Thistlethwaite (1959b, pp. 187-188), 
b Independent of % NS Majors and % PhD Aspirants, 
" 
ee elation Were inadvertently interchanged in the original report. 
"p< 01 


input variables, and three 
four press scales have borderline De< 
.05) relationships with input. 

In order to remove the effects of 
these two input variables from the 
correlations previously obtained be- 
tween college presses and PhD pro- 
ductivity, second-order partial cor- 
relations were computed, Table 5 
shows these correlations, along with 
the original correlations given in the 
earlier report. ° It can be 


of the other 


seen that, all] 
the larger correlations tendeq to 
shrink when the two input variables 


were partialed out. Of the 1 
viously significant (p < .01) ¢ 


5Second-order partial correlati 
relatively small sample such as ¢) 
be regarded as very rough appr 
They are presented here only to 
in the effects of student input 
inal correlations. 


1 pre- 
orrela- 


Ons in q 
his should 
‘oximations. 
show trends 
on the orig- 


tions, only 
ductivity a 
Sclence fac 
mained sig: 
Involving 


one (between AHSS Pro- 
nd “Excellence of soci@ 
ulty and resources”) Te 
nificant; five of these, al 
AHSS Productivity, wer 
Teduced to borderline significance? 
(p> 01, <.05), and five to nonsignifi- 
ane In addition, all five previously 
orde Correlations were reduced to 


e trend 
Previously 
with Ng p 
borderline 
Procedure, 


1, adjusting the 
undergraduate 
Tms of differ- 


RE-EXAMINATION OF COLLEGE PRODUCTIVITY 


ences j ‘ 
ated Bn aptitude levels of their 
Hees oes not result in productivity 
ee os are independent of stu- 
‘oteatstee ity. At least two other char- 
=e <a of entering student bodies 
NS and th entage planning to major in 
PhD de et percentage aspiring to the 
a adjust 2: Sony significantly related 
ondly, tk ed productivity rates. Sec- 
for a ee input variables account 
ships ran share of the relation- 
iver iously found between college 
tivity, in and adjusted produc- 
in PhD short, much of the variation 
ite which had been 
ing effects attributed to the motivat- 
yee of the colleges now appears 
the stud unction of differences among 
Bn fe bodies which enroll. 
these anal itional question posed by 
tion for yses concerned the justifica- 
dexes, The tat two productivity in- 
lajors ta correlations of % NS 
Table y h both productivity rates 
down Did suggested that breaking 
NORE eich uctivity in this way intro- 
differene raneous variability due to 
interest, T in the students’ fields of 
directly. ri check this notion more 
school’ he difference between each 
tained productivity rates was ob- 
rate) Foe is, NS rate — AHSS 
differen he correlation between this 
84. Tn : score and % NS Majors was 
fanadtuce of this finding, the total 
(sea Thi d) PhD productivity rate 
Was histlethwaite, 1959b, p- 184) 
eotrdion ne for each college. The 
and 9 10n between total productivity 
Males PhD Aspirants was 81. (% 
signif and % NS Majors were not 
tivit cantly related to total produc- 
Soa It seems apparent, then, that 
Rito two-thirds of the variation 
Biodee these 36 colleges in total PhD 
tt luctivity can be accounted for on 
he basis of a single 1m 
These findings have 


Cations, For one thing, they highlight 


177 


One of the difficulties in attempting to 
assess the influence of the college on 
its students: differential “effects” of 
colleges may well turn out to be 
spurious, unless the relevant student 
input variables are controlled. For 
another, these results suggest that it 
may be necessary to reinterpret some 
of the results of other studies, not only 
more recent ones of college “press” 
(Thistlethwaite, 1960), but also those 
which utilize adjusted college produc- 
tivity rates (e-g., Holland, 1960). 
From the available evidence, it is 
not clear whether or to what extent the 
undergraduate institution stimulates 
or inhibits PhD productivity. The high 
correlations found between input and 
productivity indicate that the college 
plays a relatively smaller role in this 
sample than was previously supposed ; 
however, we would have to study 
larger samples of students and colleges 
before we could determine precisely 
the relative contributions of input 
variables and college environments. 
Moreover, since this sample of colleges 
is far from representative,” due caution 
should be used in attempting to gen- 
eralize the results to other institutions. 


SuMMARY 

The PphD-producing rates of 36 col- 

leges were subjected to further study. 

It was shown that a college’s produc- 

tivity rates are related to character- 

5 its entering students other 

just their academic ability; 
tage planning 

i tural science and the per- 

centage aspiring to the PhD degree. 

's of these two input 

ed out, correla- 


tained between col- 


lege press al 
reduced consider 
olleges were selected initially 


were attended by relatively 
1 Merit Scholars. 


* These 36 ¢ 
because they 


large numbers of National 


178 


REFERENCES 

Guttrorp, J. P. Psychometric methods. New 
York: McGraw-Hill, 1954. 

Houtanp, J. L. Undergraduate origins of 
American scientists. Science, 1957, 126, 
433-437. 

Houtanp, J. L. The prediction of college 
grades from personality and aptitude 
variables. J. educ. Psychol., 1960, 51, 
245-254. 

Kyapp, R. H., & Goopricu, H. 
American scientists. 
Chicago Press, 1952. 

Knapp, R. H, & Greenpaum, J. J, The 
younger American scholar: His colle- 


giate origins. Chicago: Univer, Chicago 
Press, 1953. 


1. B. Origins of 
Chicago: Univer, 


ALEXANDER W. ASTIN 


TuisttetHwaite, D. L. College eas 
ments and the development of talent. 
Science, 1959, 130, 71-76. (a) nd 

TuistteTHwaite, D, L. College press hol 
student achievement. J. educ. Psychols 
1959, 50, 183-191. (b) and 

TuistLeruwarte, D. L. College ae tile 
changes in study plans of talente 22- 
dents. J. educ. Psychol., 1960, 51, 
234. : 

Usitep Srares Orrice or Epucation. ane 
ing (fall) enrollment in higher © Jar 
tion, 1958: Institutional data. eee 
No. 544) Washington, D. C.: ee 
States Government Printing Office, 


(Received December 16, 1960) 


JOURNAL OF 
EDUCATIONAL PSYCHOLOGY 


Volume 52 


August 1961 


Number 4 


AN ANALYSIS OF THE FACTORIAL DIMENSIONS OF 
E ACHIEVEMENT MOTIVATION CONSTRUCT 


JAMES V. MITCHELL, Jr. 


University of Texas 


ee construct of achievement mo- 
dior - has been the object of con- 
cement le research and discussion in 
en res (Atkinson, 1958; McClel- 
1953) tkinson, Clark, & Lowell, 
eee and it is only too apparent that 
te are wide differences of opinion 
Sar the adequacy of current efforts 
oo that intriguing variable 
1959) 50 1960; Blake & Mouton, 
oe. Much of that effort, of course, 
a keke focused on the assessment of 
Tes ot ooo by means of analy- 
Néleped AT protocols, a technique de- 
eid a by McClelland et al. (1953) 

mia regarded as sui generis by its 
= ees. One adherent of this tech- 
ae, has even relegated all other 
ao of assessing achievement mo- 
‘iia ion to the limbo of “ad hoc” de- 
ae and he admonishes all users of 
ig devices not to assume that the 
een conceptions and factual in- 
: mation resulting from projective 

nalyses of n Achievement are equally 
Pertinent to studies involving the “ad 

0c” assessment devices (Atkinson, 
1960, pp. 275-276). Others not quite so 
identified with the McClelland ap- 
Proach seem equally insistent that n 

chievement scores from different in- 
struments exhibit significant relation- 
Ships to one another and to the com- 
mon behavioral data they are supposed 
to measure. With discouraging regu- 
larity, however, investigators have re- 
Ported the absence of relationship 


179 


between different measures of achieve- 
ment motivation (Bendig, 1957; Hi- 
melstein, Eschenbach, & Carp, 1958; 
Melikian, 1958), and attempts to pre- 
dict actual behavioral criteria have 
often resulted in essentially negative 
findings, confusion, or vast conceptual 
reinterpretation (Birney, 1958; Laza- 
tus, Backer, Broverman, & Mayer, 
1957; Vogel, Baker, & Lazarus, 1958). 
It is readily apparent that there is a 
rather pronounced lack of consensus 
about both the conceptual and meth- 
odological aspects of the achievement 
motivation construct, and there is an 
urgent need for additional research 
that will help to provide structure for 
its currently amorphous state of ex- 
istence. 

The present investigation came 
about as a result of a larger study of 
the cultural and situational determi- 
nants of achievement motivation for 
a group of college students. For this 
larger study the measurement of 
achievement motivation was obviously 
a critical operation, and since the re- 
search literature provided ample evi- 
dence of the difficulties involved in this 
kind of assessment, it seemed impera- 
tive to select as many different meas- 
ures of achievement motivation as 
would be necessary to assure a thor- 
ough representation of that domain 
and then to make a careful analysis 
of their correlates and factorial dimen- 
sions. It was early recognized that such 


180 


a procedure should result in findings 
that would have general theoretical 
significance and applicability, because 
they would help to shed light on 
whether achievement motivation was 
a unitary construct with invariable 
meaning, as many seem to regard it, 
or whether it referred in fact to sev- 
eral psychological dimensions, each 
with different implications and differ- 
ent empirical correlates. It is this task 


that defines the purpose of the present 
investigation. 


ProcepuRE 

The following instrument: 
in the analysis to represent the domain of 
possible achievement related attitudes, 

1. McClelland Test of Achievement Mo- 
tivation. This instrument was administered 
and scored in accordance with the instrue- 
tions given by McClelland in his book The 
Achievement Motive (McClelland et al., 
1953). In scoring a representative sample of 
protocols provided in that book, the present 
investigator established that his scores cor- 


related 92 with those reported by the test 
author. 


2. A sentence completion t 
ment motivation. This test 
by the investigator as a simp’ 
measure of achievement 
the thought that such a m 
have more validity than an instrument with 
the scoring complexity of the McClelland 
test. The instrument consisted of 46 items 
some developed by the author and some 
taken from experimental tests developed by 
other investigators. The items selected rep- 
resented various degrees of achievement 
orientation, from strongly achievement 
oriented to complete lack of achievement 
orientation. The instrument was Scored by 
applying to each item the very first cri- 
terion of the McClelland system for scoring 
the TAT protocols, viz., the presence or 
absence of an “achievement goal.” Aceord- 
ing to McClelland, the existence of an 
achievement goal is indicated by any state- 
ment which implies “success in competition 
with some standard of excellence” (McClel- 
land et al., 1953, p. 110). If such were ex- 
pressed or implied in the completion of any 
item, that item was scored +1. If the item 
completion involved no achievement im- 
agery of this type, it was scored 0. If the 
item completion expressed or implied an 


S were employed 


est of achieve. 
was developed 
lified projective 
Motivation, with 
easure might well 


JAMES V. MITCHELL, JR. 


evident lack of success or pepe een! 
peting with some standard of exce test was 
was scored —1. The score on ie scores. 
simply the algebraic sum of the i * tainiDg 
3. An adjective checklist co chieve- 
achievement related items. A ea to those 
ment related adjectives were adde' esent in 
similarly oriented and already i Values 
the Bilis’ Index of Adiustssent a and for 
(Bills, Vance, & McLean, 1951); 4 rated 
each of these adjectives the ye auhiiee® 
himself on a 5-point scale. The his in- 
achievement motivation score OD ical rat- 
strument was the total of the ees aie Cor- 
ings for the contributing adjectiv ore was 
rected split-half reliability for Se une to 
84. Remaining items not contribu urpose 
this score served to mask the real P’ ad- 
for which the instrument was Dene also 
ministered. Noncontributing er fol- 
served this same function for the tw 
lowing instruments, « rement 
4. An inventory-type test of ace anil 
motivation. This inventory sontainss whie 
of the True-False variety, many © Other 
Were borrowed from the MMPI. O17, 
items related to achievement mgt 
Were added by the investigator. The struc 
rected split-half reliability of this 1 
ment was .75, ; 
5. A multiple-choice questionnaire aie 
cluding achievement motivation ben on 
a’ternatives for each item in the ques "ane 
naire were serially ordered so as to eer 
choices along a continuum represent? 


: D 
Sreater or lesser degrees of the variable 
question, The t 
chosen thus de bes 
tinuum and the individual’s score for th 
Particular ite. 


liability of th 


jn- 


his scale was j i to 
= neluded he battery 
clarify whether led in the 


i i roper” 
ties of Manifest the imputed drive Pp! 


the disciples of H 
Verification fro 
c. levement 


Vel of 
e 


he first an 
emester all su 
e: - 2 
fe = levels of aspiration by answe!” 
reasonably eee “What grade can you 
i BY Pees to attain on this examina- 
ceived ine estimates were origi- 
fea etter Srade form and Me 
“Point nu: ical scale 
and numerical 


~ distinctions, e.g 


FACTORIAL DIMENSIONS OF ACHIEVEMENT MOTIVATION 


Cc = 
wd mh Actual grade received on the 
10poing on was also converted to the same 
feotae peneHieal system. For each sub- 
receive anor value of the actual grade 
aloo of i subtracted from the numerical 
ie Ra ie level of aspiration estimate. 
ett a eonees for both the first and last 
See Goalline in two indices which 
actually ce a to which the subject 
aspiration, ained or approached his level of 
8. 
tice of discrepancy between 
Gal. staan a and potentiality. The numeri- 
semester j Verages for all subjects for the 
we, Gan a ge the study was conducted 
into Sade normalized, and converted 
or all maa score form. The ACE scores 
and cotvarey s were similarly normalized, 
iscrepane ed into standard score form. A 
subject ed Score was computed for each 
for the Loge his standard score 
semester gr He his standard score for his 
ACE), grade point average (GPA — 


All 
cured ad afore-mentioned data were se- 
Were in at 1 female college students who 
analysis wy eacher training curriculum. The 
jo pe Riag divided into two parts: (a) 
Were correlat rom the various instruments 
grade point ed with one another and with 
sult. of th average criteria, and (b) as a re- 
analysis ah Insights gained from the first 
Drising ee of the individual items com~- 
variables '€ various instruments, some 29 
subjected 9 all, were then intercorrelated, 
escribed be a centroid factor analysis as 
quently y Thurstone (1947), and subse- 
rotated to orthogonal simple struc- 


Ure 4 
(1958), means of Kaiser’s varimax technique 


Resuurs anp Discussion 


b ane 1 shows the intercorrelations 
tivation Poe of achievement mo- 
n, ACE, and GPA. The GPA— 

hte _eocaaney score (last column) 
Be unctions as the criterion variable 
achievement motivation. Immedi- 
j ely apparent are the differences in 
Orrelations between this criterion and 
© various measures of achievement 
Motivation, The simple adjective 
checklist was the best predictor of 
the criterion. The achievement moti- 
Vation inventory was significantly re- 
lated to the criterion, but, unlike the 


181 


adjective checklist, had a significant 
negative correlation with ACE as well. 
The level of aspiration measures were 
related significantly to the criterion, 
but this may be partly artifactual in 
nature, resulting from the presence of 
a common component, grades, in the 
make-up of predictor and criterion. 
Manifest anxiety was quite unrelated 
to the criterion. Neither of the projec- 
tive instruments (sentence completion 
and TAT n Achievement) was related 
to the criterion. Interestingly enough, 
however, the sentence completion test 
did have a significant and moderately 
high negative correlation with ACE 
and a lower but still significant nega- 
tive correlation with GPA. These cor- 
relations, along with the negative cor- 
relation reported earlier for the 
achievement motivation inventory and 
ACE, re-emphasize the necessity for 
cautious empirical inquiry into the 
real dimensions and correlates of com- 
plex measures of complex constructs 
like achievement motivation. 

The intercorrelations between the 
various measures of achievement mo- 
tivation included in Table 1 show some 
interesting and often perplexing 
trends. The adjective checklist is sig- 
nificantly correlated with the achieve- 
ment motivation inventory but not 
with the sentence completion test. The 
sentence completion test is, however, 
significantly correlated with the 
achievement motivation inventory. 
The pattern indicates some overlap 
between instruments that presumably 
measure the same variable, but hardly 
inspire confidence in the unidimen- 
sionality of the achievement motiva- 
tion construct. The level of aspiration 
measures, besides being related to the 
criterion, are also related significantly 
to one or two other measures of 
achievement motivation. The TAT 
and manifest anxiety variables show 
little relationship to other measures. 


182 JAMES V. MITCHELL, JR. 
TABLE 1 
INTERCORRELATIONS BETWEEN Measures or ACHIEVEMENT MorTIvATION, 
ACE, ann GPA 
Achieve- Aspiras | Astrs 
nt fone | cone PA; 
Measures Motiva- S Come | Ache. Grades | Geotcn Manifest! ace | GPA Qe 
Inven- pletion ment (first (last nxiety 
tory test) test) 
— 
** 
Adjective Checklist] .40** | 13 | —.97 15} .02 | ~.15 |—.02 ‘36 oe 
Achievement Mo- 35°" | —.07 | .07 |—.20*| ‘og |—.23¢+|—.06 | +1 
tivation Inven- 
tory 
Sentence Comple- —.04 | ~.18+|_ 96 .09 |—.32**|—.17* | .09 
tion 
TAT n Achievement 13] .18* 1 0g |—.07 
Aspiration-Grade® -26** a ae 49** ai** 
(first test) F ‘ we 
Aspiration-Grade* 05 i9* | .4g**| .27 
(last test) ‘ 
Manifest Anxiety —.97*4|— 11 09 
ACE “44 


smaller discrepancies, 
* Significant at .05 level, 
** Significant at .01 level. 


The whole pattern of interrelation- 
ships was one that Suggested not only 
the multidimensionality of the puta- 
tive achievement motivation con- 
struct, but also the probability that 
various measures of that construct 
might reflect quite different aspects of 
it and would therefore either be little 
correlated or selectively correlated 
with other measures and with the cri- 
terion. A factor analysis of all of the 
items of these various 
seemed admirably suited to shed light 
on such questions as these, and such 
an analysis was undertaken as the 
second step in the investigation. 


measures 


*The correlation matrix for ¢ 
vidual achievement motivation 
were subjected to factor analysi 


he 29 indi- 
items that 


i a iS has been 
deposited with the American Documenta-_ 
tion Institute. Order Document No. 6739 


from ADI Auxiliary Publications PSieek 
Photoduplication Service, Library of Con’ 
gress; Washington 25, D. C., remitting jn 
advance $1.25 for microfilm or $1.25 fo, 
photocopies. Make checks payable to: Chief, 
Photoduplication Service, Library of 
gress. 


Con- 


Table 2 shows the rotated factor 
matrix which resulted when the indi- 
vidual items of the achievement mo" 
tivation measures were intercorrelate 
and factor analyzed. The items of the 
multiple-choice inventory, not 1° 
cluded in the earlier analysis, were 1D" 
cluded in the present one to broaden 
¢ basis for analysis and comparison 
actorization was terminated whe? 
e factors isolated accounted for &P- 
Proximately 100% of the estimated 
Common factor variance, A factor W25 
not interpreted, however, unless the 
Product of its two highest loadings eX- 
ceeded two standard errors of a Zer0- 
order correlation for an N of the mag- 
nitude used in this study (Fruchter, 
1954, p. 79) Two criteria were use 


for the rotational procedure: the vari- 
Max criterion ( 


Tequirement that the 


re Aller 


Oe Oe EEE Eo EE —— —- i _——=—- — 


TABLE 2 


Rotatep Factor Matrix For 29 Inpices oF ACHIEVEMENT MorrvaTIon 


Measures I r It Iv ¥ VI VII 
1. GPA-ACE 74 01 00 05 | —20 08 09 
2. Aspiration-Grade Discrepancy? (first 30 | —07 | —32 |} 30] —05 10 | —14 
test) 
3. Aspiration-Grade Discrepancy (last 37 | —04 | —35 | —11 | —16 | —06 | —05 
test) 
Adjective Checklist Items: 
4. Ambitious 25 09 33 36 | —22 20 26 
5. Industrious 28 13 42 43 | —32 17 09 
6. Hardworking 48 13 58 26 | —19 | —09 18 
7. Studious 56| 07| 36| 40) o4| 04| 16 
8. Competitive 11} 34] O8}| 45] 11) 01) 08 
+ Motivated 44 15] 28) 60|—05] 03 10 
febievement Motivation Inventory Items: oR 
. ae more ambitious than the average 32 04 23 29 | —04 31 
erson. 
11. When ~ faced with difficult tasks, I | —06 00 | —05 | —52 05 01 14 
&enerally “give up’? too easily. 
12. T = considerable effort to get good 37 12 57 03 06 28 09 
grades. 
13.1 eee were more successful as astu- | —18 | —33] 23] —20] 23] 06] —29 
ent. 

14. I try very hard to do well inanything} 29] 06| 18] 05] 12] 54] 12 
15 I undertake. 8 
- I find that the standards I set for my 01 | —31 11} —15 55 17 | —0 

Own work are usually too high for me 
ig to attain, 
- My parents exert considerable pres- | —27 01 07 10| 49] 04] —08 
Sure on me to achieve good grades. 

17. Thave strong motivation to succeed in | 49] 03| 40] 17] 20] 38| —17 
18 my work. 2 
- I feel under constant pressure to do 04 02 16 | —01 53 o4 0: 

well in my school work. 
19. Manifest Ansi - 30] —-17} 09 

nxiet; 13 | —36 | —09 11 
20. Sentence Completion 09} -03} 33] 26} 14} 34] 07 
Multiple-Choi Aidaees 
-Choice Questionnaire Items: 

© you feel that thus far you have 33 35 07 20 | —41 o1 | —17 

really lived up to your parents’ ex- 

Pectations for you? - - 
22. Are your Barn. satisfied with your 59 27 | 0%) <2¢ | —at aot og 

Grades in college? es = 
23. How hard dowouwark onyoursehool-| 75| 05] 44/ 00| 05 02 | —08 

work in comparison with the typical 

Student of your sex? = 15 | —02 
re ow important are grades to you? "Ee _ = “0p 01 | —02 

5. Do you feel that your paren Si a 

isfied with you as a perso = 20] —O1} 02] 04 
26. Are you eatished with yourself as a] —O1/ 69] —O4| 20 
27, Se anit nat parents agree on —~03 48 24 | —22 il 01 24 

“values’?—what constitutes sthe 

i in life 

really important things 1 i 1s| 13/07! 10] 18 
28. How would you rate y’ oe poet bora ? ia 

in comparison with the 

dent of your sex? 5 
Si. nineteen —o1 | —02] —09} 05] 00} —06 | —41 


® Direction of acoring rever™® 
the smaller discrepancies: 


.d by subtracting each discrepancy from a constant; the higher scores represented 


184 

@ 
cational retettion thet had to be made 
to meet the requirements of the sec- 
ond. The following factors were ten- 
tatively identified. 

Factor I had its highest loadings for 
the GPA—ACE criterion and for two 
variables (23 and 28) which reflect the 
actual effort expended to do an effec- 
tive academic job. Other variables 
have loadings to the extent that they 
also reflect an earnest application of 
effort to obtain tangible academic re- 
sults. This factor might best be called 
Academic Motivation and Efficiency, 
It is interesting to note that the varia- 
bles having the highest loadings on 
this criterion factor were objective 
self-rating indices, that the level of 
aspiration measures had moderate 
loadings on the factor, that the two 
projective measures (20 and 29) did 
not have loadings on this factor, and 
that neither manifest anxiety (19) nor 
self-satisfaction (26) had loadines on 
this important criterion factor, 

Factor II had its two highest load- 
ings for items relating to Self-satisfac- 
tion and acceptance (25 and 26). It 
has negative loadings for manifest, 
anxiety (19) and for the item express- 
ing a desire to be more Successful as q 
student (13). It seems appropriate to 
refer to this factor as Self-Satisfaction 
It is interesting to note the lack of Ten 
lationship between this Self-Satisfac- 
tion factor and the GPA~Aqn cri- 
terion. 

Factor III can best be interpreted 
in terms of its contrast with Factor I 
It has no loading for the GPA~AcR 
criterion. It has negative loadings for 
the level of aspiration measures, in- 
dicative of its correlation with larger 
discrepancies between aspiration and 
achievement, as contrasted with the 
positive loadings of the first factor. Ty 
has a positive loading for the item ex_ 
pressing the desire for more success ag 
a student, in contrast to the first fac- 
tor’s negative loading for the same 


JAMES V. MITCHELL, JR. 


item (13). Its loading of .04 contrasts 
With the first factor loading oi 59 ist 
the item expressing parental satisfac- 
tion with the subject’s grades (22). Its 
much smaller loadings for Items 23 
and 28, earlier interpreted as reflecting 
actual willingness to expend effort to 
achieve results, must be contrasted 
with the corresponding very high load- 
ings for the first factor, For most of the 
remaining variables there seems to be 
a similar tendency for low loadings to 
be associated with items referring ex- 
plicitly to effective application to aca- 
demic work, while the items with high 
loadings are more vague and general- 
ized, less behavior oriented, and more 
Susceptible to a positive response set. 
All the evidence, and particularly that 
already noted for Items 1, 2, 3, and 18, 
Suggests that Factor III should be 1n- 
terpreted as Wish-Fulfillment Moti- 
vation. The factor seems to represent 
strong desire but little application ©! 
fulfillment, : 
It is interesting to compare the third 
factor loadings of the sentence com- 
Pletion test with its first factor load- 
ings. The resulting contrast strongly 
Suggests that projective test users must 
&void jumping to conclusions abou 
what level of experience or motivation 
8 being tapped by a projective test. 
© appreciable loading of the sen- 
tence completion test on the Wish-Ful- 
rage factor, its lack of loading 0” 
Ae first factor, and the earlier re- 
Ported correlation of —32 between 
Sentence completion and ACE, all sug- 
fee this projective may tend to 
see. ee, € less able student who 
a for high achievement but who, 
cannot. of discouragement or inertia, 
really buckle down to achieve 
eis This hypothesis also receives 
correlations Tmation from the original 


has a 
Aadjectj 
and a high ne, re 


high positive load- 
ve “motivated” (9) 
Ve loading for the in- 


FACTORIAL DIMENSIONS OF ACHIEVEMENT MOTIVATION 


ventory item indicating the subject’s 
tendency to “give up” with difficult 
tasks (11). Several lesser loadings rep- 
resent similarly achievement oriented 
variables. What most distinguishes 
this factor, however, is its lack of load- 
ings for all items related strictly to 
academic motivation alone. Items hay- 
ing no loadings on this factor include 
two items concerned with willingness 
to work hard on schoolwork (12 and 
23), one concerned with the impor- 
tance of grades to the subject (24), 
and one concerned with the effective- 
ness of the subject’s study habits (28). 
Because of this specific exclusion of 
academic motivation from what seems 
to be a generally positive achievement 
orientation, it seems logical to desig- 
nate this factor as Nonacademic 
Achievement Motivation. Appropri- 
ately enough, this factor has no load- 
Ing for the GPA—ACE criterion. 

__ Factor V is notable for its high load- 
Ings for items concerned with feeling 
under parental pressure to do good 
academic work (16 and 18) , aiming at 
Standards that are too high to attain 
(15), but still not living up to parental 
expectations or achieving grades satis- 
factory to them (21 and 22). This fac- 
tor seems most clearly to define an Ex- 
ternal Pressure to Achieve. 

Factor VI, the most difficult factor 
to interpret, has a high loading for an 
item referring to the subject’s willing- 
hess to try hard in all he undertakes 
(14) and lesser but still appreciable 
loadings for items reflecting a gener- 
alized desire to achieve (10, 12, and 
17). Yet it is quite unrelated to the 
GPA~ACE criterion. Unlike the third 
factor, it has no loading for the level 
of aspiration measures (2, and 3) or 
for the item reflecting a desire for more 
success as a student (13). Tentatively 
we will call this factor Imputed Gen- 
eralized Motivation without Attend- 
ant Effort, It is interesting to note that 
the sentence completion test again 


185 


loads on a factor that involves desire 
to achieve but little evidence of real 
or appropriately directed effort. 

Factor VII should probably be in- 
terpreted as a residual or error factor 
of no importance. The product of its 
two highest loadings did not exceed 
the two standard error criterion. Its 
highest loading was for the TAT n 
Achievement test, whose appearance 
on this error factor is most probably 
due to low reliability. This same test 
had but one significant correlation in 
the entire battery, and this was low 
and perhaps attributable to chance. 

An analysis of this type would seem 
to have several implications requiring 
the thoughtful consideration of re- 
searchers in this area. First, it is evi- 
dent that the achievement motivation 
construct is not a unitary construct 
with invariable meaning, and that any 
attempt to regard it in this light is 
likely to lead to methodological con- 
fusion, confieting results, and conrep- 
tual stagnation. In this study it was 
possible to isolate five or six relatively 
independent dimensions which repre- 
sented different and psychologically 
significant aspects of the desire to 
achieve. It would therefore seem im- 
perative for each research worker in 
this area to offer some empirical evi- 
dence of which of several possible va- 
rieties of achievement motivation he 
had chosen to employ in his investiga- 
tion. To ignore the complexity of the 
achievement motivation construct is 
to render many results uninterpreta- 
ble. 

Secondly, it is apparent that tests 
and test items showing equivalent ini- 


tial promise may turn out to differ SS 


widely in their ability to measure any 
given aspect of achievement motiva- 
tion. Empirical surprises seem almost 
the rule rather than the exception. The 
TAT n Achievement test had the low- 
est communality of all 29 variables 
employed in the factor analysis, and 


186 


it loaded only on an error factor. Its 
one significant correlation in the origi- 
nal correlation matrix was a negative 
correlation with an item relating to 
ambition, and its correlation with the 
GPA—ACE criterion was —.07. On the 
other hand, two individual objective 
items (23 and 28) had very high load- 
ings on the first factor and were cor- 
related .53 and .52, respectively, with 
the GPA—ACE criterion. This is a 
notable result, not only because such 
correlations are even difficult to 
achieve with whole tests, but also be- 
cause they were achieved with objec- 
tive, multiple-choice, self-rating items 
that focus on concrete, easily observed 
behaviors that are a part of everyday 
experience. 

Thirdly, tests and test j 
differ widely in terms of whi 
lar aspects of the desire to achieve 
complex they actually measure, For 
example, the sentence completion test 
had a .09 loading on the first factor 
a correlation of .09 with the GPA~ 
ACE criterion, and significant corre- 
lations of —.32 and —.17 for the ACE 
and GPA, respectively. There is cer- 
tainly no evidence of its validity as 9 
measure of achievement motivation 
here, and if GPA had been used asa 
criterion, results would be directly 
contrary to hypothesis. But the com- 
plete analysis revealed that this test 
was loaded on the Wish-Fulfillment 
factor, and its loading there gave ere_ 
dence to Lazarus’ (Lazarus et al., 
1957) contention that there is no one- 
to-one relationship between the ex- 
pression of a need in fantasy and its 
actualization in behavior. Indeed, the 
fantasy may even serve as a substitute 
for overt striving. Analyses of fantasy, 
then, may not always be “depth” anal-_ 
yses; they may actually reveal rela- 
tively superficial aspects of behavior 
and personality. . 

Certain objective items also turned 
out to measure quite different aspects 


tems may 
ch particu- 


JAMES V. MITCHELL, JR. 


of the desire to achieve complex ee 
had been originally anticipated. It = 
been anticipated, for instance, he 
Items 8, 11, and 18 would be hig ‘ 
related to the criterion. As it ee 
oped, they were not at all related ; 
the criterion nor to the first nee 
they were, however, loaded on . 
factors. Items 22 and 28, on the < 
hand, were highly related to the 
GPA—ACE criterion, having corre . 
tions of .50 and .52, respectively, bu 
their patterns of loadings in the factor 
matrix were quite different. Here, toe, 
cautious analysis is needed to pon 
tain just what aspect of the desire 
achieve complex is actually beins 
measured. <i, 
The importance of such analysis 18 
particularly apparent in comparing 
the lack of correlation between some 
of the instruments represented i? 
Table 1 with the relatively high load- 
ings of the factor matrix and the rela- 
tively high interitem correlations © 
the original correlation matrix. A co™- 
Parison of this kind furnishes compel- 
in8 evidence that intertest correla- 
tons can be attenuated by items 
*epresenting different factorial dime?- 
“ions or by same-test items actually 
counteractive in their influence. 
Fourthly, it is not only important 
acknowledge that different tests and 
test items may represent different aS- 
Pects of the desire to achieve complex; 
= e “dually important to recognize 
represen ame test or test item ney 
lie en Several different aspects 0 
jective we? &chieve complex. The ad- 
has lo industrious” (9), for instance, 
Oadings on four different factors 
“ae four distinctly 
tudes toward achieve- 
Ga are several other items 
a toe haces loaded - 
at j : S. is appare 
di me eo "esponses can hate very 
j ®anings for different sub- 


FACTORIAL DIMENSIONS OF ACHIEVEMENT MOTIVATION 


Pre there were certain by-prod- 
ee this investigation that are 
ete s note despite their peripheral 
ee he pattern of loadings for 
edian Gee did not lend confir- 
ea the hypothesized drive at- 
tens of that variable, but it must 
fone nowledged that this investiga- 
a at, not designed as a critical test 
fee 1 orges It is also interesting to 
batch, the item relating to self- 
wie “a Aon (25) was correlated .00 
with El align criterion and .09 
fatior pone. The Self-Satisfaction 
negligible ; similarly insignificant and 
with ot le oadings for items concerned 
i oe rictly scholastic motivation (7, 
kes ’ * ee 24). Apparently the self- 
Siete mas De our college students are 
atin arly as influenced by matters re- 
Mice motivation and 

: a ar i in- 
clined tg Ks hat are sometimes in 


SuMMARY 


olen’, Purpose of this study was to 
vation Ww hether achievement mo- 
invariabl was a unitary construct with 
relative] € meaning or a complex of 
With differ independent dimensions 
ent em erent implications and differ- 
ieGiee or, correlates. Twenty-nine 
Were aa achievement motivation 
actors jected to factor analysis. Six 
ae were identified: Academic Mo- 
toe “ and Efficiency (the only fac- 
deere predictive of actual aca- 
1¢ performance) , Wish-Fulfillment 
Otivation, Nonacademic Achieve- 
Ment Motivation, Self-Satisfaction, 
External Pressure to Achieve, and Im- 
Puted Generalized Motivation without 
Attendant Effort. In addition to dem- 
Onstrating that there are several va- 
Ticties of achievement motivation, the 
results also indicated that: tests and 
test items differed widely in their abil- 
ity to measure any of the several facets 
of this complex variable, identical re- 
sponses could have very different 


187 


meanings for different subjects, self- 
rating procedures excelled projective 
methods in predicting actual academic 
performance, and the projectives were 
either loaded on wish-fulfillment or er- 


ror factors. 


REFERENCES 


Arxryson, J. W. Motives in fantasy, action, 
and society: A method of assessment 
and study. Princeton: Van Nostrand, 
1958. 

Arxinson, J. W. Personality dynamics. 
Annu. Rev. Psychol., 1960, 11, 255-290. 

Benpic, A. W. Manifest anxiety and pro- 
jective and objective measures of need 
achievement. J. consult. Psychol., 1957, 
21, 354. 

Bus, R. E., Vance, E. L., & McLean, 0.8. 
‘An index of adjustment and values. J. 
consult. Psychol., 1951, 15, 257-261. 

Birvey, R. C. The achievement motive and 
task performance. J. abnorm. soc. Psy- 
chol., 1958, 56, 133-135. 

Buaxe, R. R., & Mouton, J. S. Personality 
Annu. Rev. Psychol., 1959, 10, 203-232. 

Frucurer, B. Introduction to factor analy- 
sis. New York: Van Nostrand, 1954. 

Hinexstet, P., Escuensacu, A. E., & Carp, 
A. Interrelationships among three meas- 
ures of need achievement. J. consult. 
Psychol., 1958, 22, 451-452. 

Katser, H. F. The varimax criterion for 
analytic rotation in factor analysis. Psy- 
chometrika, 1958, 23, 187-200. 

Lazarus, R. S., Backer, R. W., BroverMAN, 
D., & Mayer, J. Personality and psy- 
chological stress. J. Pers., 1957, 25, 559- 
577. 

McCuetianp, D. C., ATKINSON, Ji Whe 
Cuark, R. A., & Lowe, E. L. The 
achievement motive. New York: Ap- 
pleton-Century-Crofts, 1953. 

Me txian, L. H. The relationship between 
Edwards’ and McClelland’s measures of 
achievement motivation. J. consult. 
Psychol., 1958, 22, 296-298. 

Taytor, A. A personality scale of manifest 
anxiety. J. abnorm. soc. Psychol., 1953, 
48, 285-290. 

Tuurstons, L. L. Multiple factor analysis. 
Chicago: Univer. Chicago Press, 1947. 

Vorcr, W., Baker, R. W., & Lazarus, R. 8. 
The roles of motivation in psychological 
states. J. abnorm. soc. Psychol., 1958 
56, 105-112. 2 


(Received December 30, 1960) 


Journal of Educational Psychology 
1961, Vol. 52, No. 4, 188-194 


CATEGORIZATION AND RELATED VERBALIZATIONS 
IN DEAF AND HEARING ADOLESCENTS! 
SOLIS L. KATES, WILLIAM W. KATES, JAMES MICHAEL, 
ano TERRENCE M. WALSH 


Clarke School for the Deaf, Northampton, Massachusetts, 
and University of Massachusetts 


Studies dealing with the perceptual 
and conceptual capacities of deaf chil- 
dren have suggested that children deaf 
from birth fail to develop the ability 
to deal with abstract material in a 
fashion comparable to hearing chil- 
dren (Larr, 1955; Myklebust, 1953; 
Oleron, 1950; Templin, 1950). Other 
studies, however, have shown that the 
differences between the thinking proc- 
esses of deaf and hearing children do 
not correspond to the differences be- 
tween the concreteness of the aphasics 
and the abstract functioning of normal 
subjects (Oleron, 1953) but correspond 
much more closely to the differences 
between a younger and older group of 
hearing children (Clarke School, 
1940). When the level of verbalization 
required is within the range of vocabu- 
lary and experience of the deaf sub- 
ject, deaf subjects perform as wel] as 
hearing subjects on abstract as well as 
less abstract tasks (Rosenstein, 1959). 

There are three major purposes of 
this experiment. First, we are inter- 
ested in investigating a specific aspect 
of the cognitive processes of deaf and 
hearing children, that of categoriza- 
tion. Categorization, “to render dis- 
eriminably different things equiva- 
lent,” forms one of the bases for gj] 
other cognitive activities (Bruner, 
Goodnow, & Austin, 1956, p. 1). Our 
second major concern is in distinguish- 
ing between the process of categoriza- 
tion on the one hand, and the process 

1Thi orted in part by Gra: 
Mo Ee oan che of Vouitioual Re. 


bilitation, United States Department of 
Health, Education, and Welfare. 


of verbalization on the other, cae 
Judge the adequacy of categoriza in 
independently of the adequacy of ve 
balization. The process of learning my 
involve both learning a new verba! 
attribute and then learning the a 
verbal category that goes with it = 
well as learning a verbal attribute 
go with a nonverbal category alrea y 
formed (Brown, 1958). Our third es 
cern is in attempting to separate t 6 
effects of deafness from the cognitiv 
effects of age and achievement ee 

The general hypothesis of this study 
was: There will be no differences 1" 
categorization or verbalization foun 
for one comparison of deaf subjects 
with hearing subjects of matched 9g° 
and IQ that will also be found for 
Comparison of these deaf subjects jon 

caring subjects matched on schoo 
achievement and IQ. 


Meruop 
Subject. 


8. One group consisted of five girls 

and three boys Ps the Clarke School for the 
eat iin orthampton, Massachusetts, TeP~ 
resenting the total number of graduating 
The with very profound hearing 1oss: 
instru ing was by the oral method of 
Mstruction. Their average IQ on the WAIS 
oth ‘eal and performance subscales} 
Achievemer, 522, 693. On the Stanfor 


. ¥88 equivalent to the 7.68 grade 

and "Tenet average IQ, based on WISC 
: ir an eNe > was 10463; SD, 
a a eae as 14.2 years. : 
School genj, P included eight high 
eniors, five girls and three boys- 


CATEGORIZATION AND VERBALIZATION IN DEAF ADOLESCENTS 189 


aot average age was 17.97 years; SD, 0.50. 
inan:Mienee IQ, based on WAIS and Ter- 
wubitate ‘emar, was 106.38; SD, 5.50. These 
fon thet te achieving at an average level 
oH ena igh school class, and were expected 
sehear uae without difficulty. The high 
ehecne : junior high school students were 
teeters be ae A in consultation with their 
included eet ae no subject was 
physical Pinch ae personality and 
ere were no significe iffer . 

5 gnificant differences be 
abies deaf and the junior high school 
similarl Naa level and IQ, and, 
enede A \ ere were no significant differ- 
tution: Ween the deaf and the high school 
sample aon age and IQ. The size of the 
. Se it necessary for this study to 
a ject is wo and, of course, 

tlegorization and V. izati 

Th 4 an erbalization Tests. 
ines Goldstein-Scheerer (1941) Object Sort- 
test conte individually administered. This 
ihe vanoh af of 33 everyday objects includ- 
pliers, anes as a real cigar, a real pair of 
& Rian ia etc. (Rapaport, Gill, 
admin ‘The Object Sorting Test was 
and com zi in three parts: the free, active, 
section ik ant sections. In the free sorting 
objects inte subject was asked to put all the 
ell why ro any number of groups and then 
Y the objects in each group belonged 


together 
w ner. In the active section, the subject 
tl 


s 


ject 


8 
8 


er 
malin preselected groupings of objects and 
® each subject to give the reasons for 


Dili 7 i 
et sections in order. 
Qs were used for the 


= hearing subjects. T 
vu the part of the expe 
erbalizations of all subjec 


seeot: tem for categori 
_ Scoring. ring system for categoriza~ 
tions nied pees: F zed partially on that 
Used by Rapaport & al. (1945, PP- 401-406). 
lassification of verbalizations were based 
Upon those used by Rapaport as well as 
pes of rules of ver- 


upon iteria for ty 
pe tne oe up by Bruner et al. (1956, pp. 


4-7). Two independent scorers agreed on 
95% of a sample of 350 items scored inde- 
pendently. Copies of the complete scoring 
system are available upon request. 

‘Statistical Procedure. Our statistical anal- 
ysis compared separately the deaf subjects 
with each one of the two control groups; the 
question of the overall differences among 
the three groups was not a concern of this 
study. 

To make some data comparable from 
subject to subject, proportions were calcu- 
lated for measures such as adequacy of 
categorization and verbalization. Estimate 
tests of homogeneity of variance were per- 
formed and heterogeneous data were con- 
verted to 7 scores (Edwards, 1954). All 
variances were then found to be homo- 
geneous. 


Resutts AND Discussion 


Prediction 1. The deaf subjects will 
have a similar number of divisions and 
categories as the hearing subjects 
matched on achievement and IQ but 
significantly more categories and di- 
visions than the hearing subjects 
matched on age and 1Q. Furthermore, 
the deaf subjects will not differ in the 
number of failures from either hear- 
ing group. Divisions are defined as the 
number of groupings and/or single ob- 
jects that the 33 objects are divided 
into in the free sorting section. The 
number of categories is equal to the 
number of groupings with two or more 
nonidentical objects included. A fail- 
ure consists of an object that is not 
grouped with another object or of an 
absolute identity class such as two 
sugar cubes. 

Results. The results, in general, sup- 
port this prediction. Deaf subjects 
have the same number of divisions, 
categories, and failures as both groups 
of hearing subjects (Table 1), and 
thus appear as effective as hearing 
subjects in the two control groups in 
reducing the complexity of this ex- 
perimental world. 

Prediction 2. The deaf subjects will 
categorize as adequately but will not 


190 


TABLE 1 
vr Test Resutts ror CoMPARISONS BE- 
TWEEN Dear and ContTRoL Groups 
Martcuep on AGE AND IQ (Ci) AND on 
STANFORD ACHIEVEMENT AND IQ (C2) 


Comparison Group M SD 1s 
Divisions Cc 8.75 | 1.79 [0.298 
Deaf | 9.00 | 1.32 
Ce 7.88 | 1.60 |1.460 
Categories (en 7.13 | 1.05 |0.182 
Deaf | 7.25 | 1.39 
C2 6.50 | 1.22 |1.070 
Failures Cy 2.13 | 1.97 |0.00 
Deaf | 2.13 | 0.93 
C2 2.25 | 1.92 10.155 
Concrete ver- |C, 0.080} 0.078|1.418 
balization> [Deaf | 0.158' 0.123 
Cy 0.212! 0.115/0.857 
Functional ver- |C, 0.214) 0.222'0. 404 
balization> [Deaf | 0.176 0.113 
Cy 0.275) 0.123|1.571 
Category width |C, 5.00 | 1.16 |2. 239+ 
with failures |Deaf | 3.65 1.10 
C2 4.13 | 0.72 0.968 
Category width |C, 5.20 | 1.03 |2.480* 
without fail- |Deaf | 3.79 1.09 
ures Cy 4.51 | 0.93 |1.324 
Formal verbal- |C, 52.33 |11.07 0.031 
izations used |Deaf |52.18 6.15 
inadequately: |C, 47.87 | 5.88 1.340 
Inclusive ver- |C, 47.35 | 9.18 0.898 
balization? Deaf (52.33 [11.45 
C2 50.37 | 7.42 0.380 


® The ¢ value is the result of the com 
the deaf group mean with the control 
opposite the corresponding t value, 

»b Raw scores converted to proportion of 
of verbalizations. 

© Proportion of formal inadequate to total i 
converted to 7’ scores. 

4 Proportion of inclusive verbalization to total num- 
ber of verbalizations converted to 7’ scores. 

* Significant at p < .05 level. 


Parison made of 
group mean just 


total number 


inadequate 


have as many adequate verbalizations 
as the hearing subjects matched on age 
and IQ. Furthermore, the deaf sub- 
jects will have the same number of 


KATES, KATES, MICHAEL, AND WALSH 


adequate verbalizations as the eo 
matched on achievement and 1Q. in 
adequate categorization is one i 
which all the objects included are thé 
vant to each other independent of ap 
verbalization, and there are no wad 
vant objects present in the sample a 
are not included in the ee, es 
adequate verbalization is defined ap 
one that covers correctly pert aed 
sorted. An inadequate verbaliza oe 
may be false, or may not take into the 
count all the objects included in if 
categorization, or may take into 
count all objects included in the eel 
gorization but pertain to objects "aie 
side the categorization for which 
verbalization is given. this 

Results. The results support er? 
prediction. Analyses of variance P e 
formed on the proportion of anare 
categorizations to the opportunities 6 
categorization show that there are oa 
differences between the deaf subje¢ 
and either of the two hearing grouP 
Tables 2 and 3). When the task o! 
grouping equivalent objects together 
1s judged independently of the verbali 
zation of the reason for that grouping 
deaf subjects perform as well as hea! 
ig subjects of their own age and IQ. 
"urthermore, the deaf subjects have 
Significantly fewer adequate verbali- 
zations than the hearing group 
matched on age and IQ (F = 6.58; 
P < 05) but there are no differences 
between the deaf group and the heat: 
778 soup matched on achievement a0 

fay cbles 2 and 3). Thus, in terms 
cuheetuacy of verbalization, the deat 

Jects seem to resemble the younge! 

Nearing Subjects, 


prediction 3. The deaf subjects will 

peri a greater number of adequate 

quate vet us a¢companied by inade- 

group raat izations than the control 

not differi ened on age and IQ while 

es ng from the hearing group 
on achievement, and IQ. 


CATEGORIZATION AND VERBALIZATION IN DEAF ADOLESCENTS 


TABLE 2 


“< AND STANDARD DEVIATIONS OF THE 
EAF GROUP AND THE Two CoNTROL 


191 


TABLE 3 
Awnatysts oF VARIANCE ResuLts For Com- 
PARISONS BETWEEN DraF AND THE CON- 
trot Group MarcuEep on AcE anp IQ 
(C;) AND ON STANFORD ACHIEVEMENT 
anv IQ (C2) 


Groups 

Variable Group M SD 
Adequacy of cate- | Deaf | 0.713" | 0.200 
gorization C,> 0.755 0.187 
Cas 0.674 0.149 
Adequacy of ver- | Deaf | 0.6244 | 0.224 
alization Cc, 0.774 0.185 
C2 0.582 0.197 

Adequate categor- | Deaf |52.48° | 8.86 
Wation—inade- | C, 45.32 6.54 
ate verbaliza- | Ca 52.83 8.25 
Formal _ Generic | Deaf |52.70° 8.92 
verbalization [on 51.99 9.54 
C2 45.37 8.95 
ik formal ver- | Deaf |51.55° | 8.90 
alization Ci 52.53 10.40 
Ce 45.82 8.08 
Proneness to Deaf |48.63° | 8.56 
‘ange CQ 52.05 9.80 
Ce 50.65 7.74 


ap q 
fatal qwePortion of the number of adequate categories to 
crhortuntties for categorization. 
ites a aa control group matched with the deaf on 
°C; 
aS is the control group matehed with the deaf on 
ora Achievement and IQ. 
tions 1 Dertion of the number of adequate worbaliaa- 
ri the number of categorizations. 
‘onverted to 7’ scores. 


Sta) 


Results. The analyses were con- 
ucted on standard scores converted 
T scores (Tables 2 and 3)... The reé- 
Sults indicate that the deaf subjects 
Ave a significantly greater number 
of adequate categorizations accom- 
Panied by inadequate verbalizations 
than the control subjects matched on 
age and 1Q (fF = 5-475 P< .05). But 
there were no differences between the 
deaf subjects and those matched with 
the deaf on achievement and IQ. The 
results support our prediction. It may 
be pointed out that the younger con- 
trol group had significantly more ade- 


Variable af SS. MS F 
Adequacy of categoriza- 
tion® 
C; Between treatment 1 0.014} + 0.014/0.280 
groups 
Error 4 0.699) 0.050) 
C: Between treatment 1 0.013] 0.013)0.317 
groups 
Error 14 0.573} 0.041 
Adequacy of verbaliza- 
tion 
C; Between 1 0.270] 0.270)6.585* 
Error 14 0.573} 0.041 
C: Between 1 0.021] 0.021]0.438 
Error 14 0.677] 0.038) 
Adequate categorization 
—inadequateverbal- 
ization® 
C; Between 1 | 410.06 | 410.06 |5.474* 
Error 14 |1048.76 | 74.91 
C: Between 1 0.96 0.96 {0.009 
Error 14 |1544.34 | 110.31 
Formal generic verbal- 
ization® 
C; Between 1 5.98 5.98 |0.003 
Error 14 |3064.02 | 218.86 
C: Between 1 | 645.55 | 645.55 |3.417 
Error 14 |2644.15 | 188.82 
Total formal verbaliza- 
tion® 
C, Between pS 11.59 | 11.59 |0.619 
Error 14 |2622.46 | 187.32 
C: Between 1 385.90 | 385.90 /3.078 
Error 14 /1755.12 | 125.37 
Proneness to Change® 
C; Between 1 93.61 93.61 |0.785 
Error 14 /1670.42 | 119.32 
C: Between a 32.46 | 32.46 |0.382 
Error 14 |1190.63 85.05 


® Raw scores converted to proportion of adequate 
categorizations to total number of categorizations. 

b Raw scores converted to proportion of adequate 
verbalizations to total number of verbalizations (number 
of divisions plus $ active plus 12 compliant). 

© Raw scores converted to T scores. 

4 proportions of formal generic verbalizations to to- 
tal number of verbalizations converted to 7 scores. 

© Proportions of total formal verbalizations to total 
number of verbalizations converted to 7' scores. 

* Significant at p < .05 level. 


192 


quate sortings accompanied by inade- 
quate verbalizations than the older 
control group. Our results suggest that 
deafness holds back the process of cor- 
rectly relating verbal attributes to 
nonverbal categories but that this re- 
tardation is similar to that shown by 
younger hearing subjects when they 
are compared to older hearing subjects, 

Prediction 4. The deaf subjects will 
have more functional, more total con- 
crete, fewer formal, fewer formal ge- 
neric verbalizations than the control 
group matched on age and IQ but the 
same number as the control group 
matched on achievement and IQ. A 
functional verbalization Specifies a 
rule of categorization based upon the 
sharing of a common function. Total 
concrete verbalizations include the 
syneretistic, fabulatory, 
bolic, split narrow verbali 
fined by Rapaport et al. 
formal specific -verbaliz, 
Formal verbalizations ar 
verbalizations specifyin 
grouping based on the 
intrinsic property (“all red,” or “all 
kitchen utensils”) among the objects 
in the group (Bruner et al., 1956). A 
formal generic verbalization Specifies 
a rule of grouping based upon the shar- 
ing of highly complex and class-like 
attributes (“all metal”). 

Results. There are no differences be- 
tween the deaf and either of the hear- 
ing control groups with respect to any 
of the verbalization types discussed 
above (Tables 1, 2, and 3). This pre- 
diction is not supported. However, the 
group matched with the deaf on 
achievement and IQ had significantly 
more “total concrete” verbalizations 
than the older control group. This Te- 
sult indicates that the test does differ- 
entiate, at least in regard to concrete 
verbalizations, levels of verbalization 
due to age and achievement differences 
in hearing subjects. On the whole these 


chain, sym- 
zation as de- 
(1945), and 
ation type. 
e defined as 
£ a rule of 
sharing of an 


KATES, KATES, MICHAEL, AND WALSH 


concrete verbalizations represent mi 
inability to utilize a conjunctive eae 
of grouping to explain a categor ee 
tion. Further, they can also be a 
scribed as “private” eee 
(McGaughran & Moran, 1956). 
subjects on these tasks are nO ats 
concrete verbally, do not demons ey 
any greater inability to utilize ailize 
junctive rules of grouping, and u thant 
no more private verbalizations 

the two control groups. 

There is little evidence and theary 
concerning the following issues, ee 
they are stated in the form of a qu 
tion, : 

Are there any differences pane 
the deaf subjects and either con re 
group in proneness to change, in jon 
gory width, in number of formal V a 
balizations used inadequately, and on 
the number of too inclusive verbaliza- 
tions? Proneness to change is ape 
ured by the exclusion of objects in th 
active sorting which were inclu nse 
With these target objects in free oT ‘ 
ing or by the inclusion, with the targe 
objects in the active sorting, of objects 
not included with them in the = 
Sorting. Category width is measure 
by the number of objects included with 
the target objects in the active sectiol: 


both counting and not counting fail- 
ures. 


The data were converted to J’ scores 
and analysis showed that there are DO 
differences bhetwe 
either of th 


Overall Proneness to spontaneous 

pontaneou: 
change (‘Tables 2 and 3). These results 
are in g 


d that deaf subjects are 
an hearing children (Mce- 
Andrew 1948) 8 : 

see By the 
Tigidity of the deat Tree 
ence betwee pene Os 
behavior ig see «nee 

ur resy] " 


‘ts show that the deaf sub- 


CATE 
'ATEGORIZATION AND VERBALIZATION IN DEAF ADOLESCENTS 


.. or narrower categories than 
19 ce group matched on age and 
fons 0 not differ in category width 
nals he hearing subjects matched on 
nus = and IQ (Table 1). The 
ae ye a narrower categories by the 
an “ls jects who, at the same time 
is ha € same number of categories as 
weiter Ate groups, may indicate a 
Work ifficulty in organizing the 
"toes Rage in generalizing rules of 
his es to a wide array of objects. 
the co course, would also be true of 
roth th group of hearing subjects 
categor om the deaf did not difier in 
ahi cd ae However, with respect 
ies ees - l array of familiar objects, 
3 Prt narrower categories might 
tive th appropriate and more predic- 
he “et the use of wider categories. 
hans P lem of the relation of deaf- 
that ee width seems to be one 
‘i sige future attention. 
4 dsemsis = proportions of formal in- 
sive tae verbalizations and of inclu- 
of net alizations to the total number 
Sivas ee respectively, were 
Paired ' to T scores and were com- 
did ee the three groups. The deaf 
contra] differ significantly from either 
verbali group in formal inadequate 
MS errialager and in inclusive ver- 
zations, respectively (Table 1). 


Summary 


e This study has investigated the cat- 
coorization and verbalization proc- 

ses of 24 adolescents, 8 of whom were 
Profoundly deaf. The remaining sub- 


Stanford Achieve- 
Goldstein-Scheerer 
4 was administered 


and the deaf subjects were separately 


Compared with each 0 
groups. 


193 


Our conclusions are as follows: 

1. Deaf subjects categorize, on this 
task, as adequately as the hearing 
subjects. They have the same number 
of divisions, failures, and categories 
as the hearing subjects. On categoriza- 
tion tasks judged independently of 
verbalization, deaf subjects perform as 
well as hearing adolescents. 

2. Deaf subjects have more inade- 
quate verbalizations than hearing sub- 
jects and more adequate categoriza- 
tions accompanied by inadequate 
verbalizations than hearing subjects 
matched on age and IQ. These differ- 
ences are not true for the comparison 
of the deaf subjects and the hearing 
subjects matched on achievement and 
IQ. It is proposed that deafness retards 
learning of the correct relation be- 
tween the proper verbal attribute and 
the nonverbal category but that this 
retardation lies along a normal devel- 
opmental progression. 

3. Deaf subjects do not differ from 
hearing subjects in the type of verbali- 
zation or in the developmental level of 
the verbalizations used, or in their 
spontaneous changes of categorization 
throughout the test. 

4, Deaf subjects have narrower cate- 
gories than hearing adolescents of 
their own age and IQ though they do 
not differ from hearing subjects of the 
same achievement and IQ. 

5. There were no differences found 
between deaf and hearing subjects that 
indicated a definite shift in categori- 
zation ability distinct from the effects 
of age and achievement. 


REFERENCES 


Brown, R. Words and things. Glencoe, IIl.: 
Free Press, 1958. 

Bruner, J. S., Goopnow, J. J., & Austin, G. 
A. A study of thinking. New York: 
Wiley, 1956. 

Cuarke ScHoot For THE Dear, Clarence W. 
Barron Research Department, Psycho- 
logical Division. Studies in the psychol- 


194 KATES, KATES, MICHAEL, AND WALSH 


ogy of the deaf. Psychol. Monogr., 1940, 
52(1, Whole No. 232). 

Epwarps, A. L., Statistical methods for the 
behavioral sciences. New York: Rine- 
hart, 1954. 

Gotpstery, K., & Scuerrer, M., Abstract and 
concrete behavior: An experimental 
study with special tests. Psychol. 
Monogr., 1941, 53(2, Whole No. 239). 

Larr, A. L. An experimental investigation 
of the perceptual and conceptual abili- 
ties of children in residential schools 
for the deaf. Unpublished doctoral dis- 
sertation, Syracuse University, 1955. 

McAnprew, H. Rigidity and isolation: A 
study of the deaf and the blind. J. 


abnorm. soc. Psychol., 1948, 43, 476- 
494, 


McGaueuray, L. S., & Moray, L. J. “Con- 
ceptual level” vs. “conceptual area” 
analysis of object-sorting behavior of 
schizophrenic and nonpsychiatric groups. 


J. abnorm. soc. Psychol., 1956, 52, 43- 
51. 

Myxtesust, R., Towards a new understand” 
ing of the deaf child. Amer. Ann. Deal, 
1953, 98, 345-357. ; _ 

Otzron, P. A study of the intelligence © ae 
deaf. Amer. Ann. Deaf, 1950, 95, 17 
195. ; eat: 

Oteron, P. Conceptual thinking of the 
Amer. Ann. Deaf, 1953, 46, ae 7 

Raparorr, D., Gini, M., & Scuarer, R. ‘i 9 
nostic psychological testing. Vol. 5 
Chicago: Year Book Publishers, 1945. 

Rosenstein, J. Perception and ocent 
deaf children. Unpublished doctoral - 
sertation, Washington University, 19. £ 

Tempiin, M. The development of reason” 

in children with normal and delerr 

hearing. Minneapolis: Univer. Mt 

nesota Press, 1950. 


(Received September 6, 1960) 


Jourrial'granaiiay 
cational P 
1961, Vol. 52, No. 4, eX aecheloay 


(& 3 
ROSS-VALIDATION GROUPS, EXTREME GROUPS, AND 
PREDICTION OF ACADEMIC ‘ACHIEVEMENT 


LEONARD J. KAZMIER* 
University of Notre Dame 


1 i 0 parallel studies, scores on the 
ade ey variables of the Ed- 
(EPP) ersonal Preference Schedule 
omper soe 1954) have been 
ing te for over- and underachiev- 
1938. Pa students (Gebhart & Hoyt, 
yaine ee 1959). In both studies, ex- 
selected an and underachievers were 
college — a larger sample of male 
range oe ie so that the middle 
os ik hievers was eliminated from 
‘lene Accordingly, analyses of 
the si mien performed to test for 
: a cance of differences between 
a Pe tices groups, but validity co- 
the seal were not computed. Though 
ner _ that were found to discrimi- 
Un emer subjects who over- or 
aptitude lieved in relation to measured 
of the t were not identical, the results 
Sloman studies did agree to a con- 
cor . extent. In the Gebhart and 
igher Pie overachievers scored 
chieve an underachievers on the 
er ment, Order, Intraception, and 
owen ency scales, and significantly 
and Che the Nurturance, Affiliation, 
eae scales. In the study by 
cage overachievers scored signifi- 
= y higher on the ‘Achievement, Or- 
age Endurance scales and signifi- 
y lower on Affiliation and 
dog sexuality. The 
ned as significant 

é here has come to be 
tan about the ite in 
eme groups in hyP° i 4 
2a SE ete, 120) 
i ch tests of signifieane’ aaa ies 
ng and may represent only th — 
Sree of relationshP penwees WS HS 


a 
Research conducte 


d while the author 
Was at Ohio State University: 


195 


variables. The primary purpose of the 
present study was to investigate the 
relationship between the EPPS scales 
and a criterion of academic achieve- 
ment when scores of all subjects are 
utilized, not just those of selected ex- 
treme groups. The correlation tech- 
nique was used throughout because it 
provides a measure of the degree of 
relationship as well as its significance. 
To ascertain that the measured rela- 
tionships of the several scales with the 
criterion are not due to chance factors, 
a cross-validation group was included 


in the study. 
PROCEDURE 


The sample consisted of 140 students en- 
rolled in two sections of introductory psy- 
chology at Ohio State University during the 
Spring Quarter of 1960. Both sections were 
taught by the same instructor, the author of 
the present report. For use as & predictor, 
six paragraphs of material, including 2 total 
of 36 questions on the contents thereof, 
were chosen from old forms of the Ohio 
State Psychological Examination (Buros, 
1959, pp. 494-495). The criterion for success 
in the psychology course was the total num- 
ber of points accumulated at the end of the 
quarter from three departmental examina- 
tions and four class quizzes. 

In order to achieve comparability of the 
experimental and cross-validation groups, 
all class cards for the subjects were segre- 
gated by sex, and then categorized accord- 
ing to college year. Within the college year 
the cards were arranged in order of per- 
centile scores on the Ohio State Psycho- 
logical Examination. Seven students for 
whom these scores were not available were 
arbitrarily placed at the end of their respec- 
tive college-year groups, after the highest 
percentile scores. Then the overall sample 
was subdivided into two groups of 70 sub- 
jects each by alternatively putting one card 
in the experimental group and the followin 
eard in the cross-validation group. one 
parability of the two groups in terms of 


196 


their standing on the reading comprehension 
test, the criterion, and the 15 EPPS scales 
was ascertained. 

The correlation of each of the 15 scales 
with the criterion of success was determined 
for each of the groups and for the entire 
sample, and the Wherry-Doolitile test se- 
lection technique (Garrett, 1958, pp. 426- 
440) was then applied to the experimental 
group of subjects in order to determine the 
best selection and weighting of EPPS scales 
for maximum predictiveness. The applica- 
bility of this result to the cross-validation 
group was investigated by computing a 
composite 7 for that group by using the 


scales and weights derived from the experi- 
mental group. 


REsuuts 


The split-half reliability coefficient 
(odd vs. even) for the reading com- 
prehension test is .813, while the coef- 
ficient of correlation between this pre- 
dictor and the criterion of total points 
in the psychology course is 593, which 
is significant at the .001 level for the 
140 subjects involved in this study. 


TABLE 1 
CoMPaRISONS BETWEEN THE 


EXPERIMENTAL 
AND Cross-Vauipation Groups 
Experi- 
penta Gross-Validation 
Te = 
Measure (V= 70) oHse my 
[_————_____ 
M |SD!| uw | sp| p 
Points in psychology |259.00)39.25 |260.10/36.05 | _a 
course 
Predictor Score 17.18} 6.22 | 16.84) 5.93 | _ 
EPPS Scales: 
Achievement 13.67] 3.94 | 14.23] 4.75 | _ 
Deference 11.41) 3.71 | 12.21) 3.67 | 1.64 
Ordec 9.44) 4.17 | 10.67] 3.90 | 3.94 
Exhibition 14.34) 3.33 | 14.34] 3.65 | __ 
Autonomy: 12.86} 4.20 | 12.66] 5.04 | _ 
Affiliation 16.26) 4.38 | 16.57] 4.49 | _ 
Intraception 17.26) 4.88 | 17.34] 4.78 | _ 
Succorance 11.38] 5.25 | 11.00) 4.13 | _ 
Dominance 15.58) 4.68 | 14.41] 5.93 | 1.95 
‘Dbasexsont 14.31) 4.99 | 14.70] 5.67 | 1 
Nuxtorance 15.66] 5.18 | 16.18] 4.81 | _ 
Ghinae 17.57) 4.78 | 16.90) 4.55 | _ 
Endurance 13.63] 5.67 | 13.08| 5.97 | _ 
Heterosexuality 14.97| 5.08 | 15.08| 5.53 | _ 
Aggressiveness 11.00] 4.62 | 10.44| 4.54 | _ 
a F< 1.00. 


LEONARD J. KAZMIER 


TABLE 2 a 
ps WI 
CorRELATION oF THE EPPS ScaLEs 
THE CRITERION OF SUCCESS 
in PsycHOLOGY 


= . Cross 1 
Experi- | valida- Total 
Scale mental tion pamnio) 
we" peal 
.153* 
Achievement 113 191 8 
Deference -013 |— -109 | “080 
Order .029 |—.214*|— “992 
Exhibition 027 157 “O13 
Autonomy .073 |—.040 . 149" 
Affiliation —.139 |—.161 |— “059 
Intraception .022 -099 “087 
Succorance —.205* 078 = 13" 
Dominance .209* .226 "060 
Abasement —.054 |—.068 |-- 164" 
Nurturance —.167 |—.163 |—- 114 
Change —.244**| 085 |— ar 
Endurance -088 |—.061 Td 
Heterosexuality 192 037 “043 
Aggressiveness +115 |—.0384] - 


ee ee Me ee 
"p< .10. 
**p < 05. 


That the procedure used to divide all 
Subjects into experimental and cross 
validation groups did result in co™ 
parability of the groups in many 1- 
Portant respects is indicated by the 
data in Table 1, which presents the 
mean scores and standard deviations 
tor the two groups on the reading com 
prehension test, the criterion, and the 
15 EPPS scales. Not one of the ob- 
tained F ratios ig significant at the .05 
level. 

The correlation coefficients betwee? 
the criterion of success in the psychol- 
gy course and each of the 15 scales of 
the EPPS are indicated in Table 2 for 

© experimental and cross-validation 
Groups as well as for the total sample. 

or the experimental group, the corre- 


lation Coefficients for Succorance and 
ominance were Significant at the .10 
vel, while that for 


nificant at the 


PREDICTION OF ACADEMIC ACHIEVEMENT 


Ga a significance at the .10 level 
For yg and Dominance scales. 
ere otal sample of subjects, the 
meat tee coefficients for Achieve- 
sini liation, and Nurturance were 
ie ae at the .10 level while that 
05 eo was significant at the 
I 
bite appear that the desired sta- 
eaiee of relationship of the EPPS 
this ee the criterion is lacking in 
Secllen i For example, of the three 
etion = ose relationship with the eri- 
ot ellen significant at the .10 level 
Ohl one in the experimental group, 
wie held up” for the cross- 
signs of “ group, while the algebraic 
S ees he correlation coefficients for 
fea hoe groups actually were 
clusion n order to verify this con- 
title 4 cy the Wherry-Doo- 
tai, ee sie was utilized to ascer- 
Sines - est selection of scales for the 
Ste eta group and the appropri- 
ion i to be used in conjunc- 
on arc scale. Stability of the 
to which ee indicated by the extent 
Use with hese beta weights apply for 
indicat ve cross-validation group. As 
thental by Table 3, for the experi- 
of the ee it was found that the use 
oder pange, Suecorance, and Het- 
mal q lality scales results in the max!- 
a of multiple correlation 
ef ed for shrinkage. Addition of 
He A eae best scale, Abasement, re- 
in a decrease in the magnitude of 
€ correlation coefficient. 
Wh he beta weights obtained by the 
i erry-Doolittle test selection method 
Te — 952 for Change —.233 for 
181 


Succorance and . for Hetero- 
Sexuality. Utilizing these weights de- 
Tived for the experimental group and 

© correlation coefficients of the 
chosen scales with the criterion 1n the 
dregs aeaticiag th, CO tact 
Composite coefficient of correlation for 
the cross-validation group of —.058. 


197 


TABLE 3 
Bust SELECTION or EPPS ScaLES FOR THE 
JEEXPERIMENTAL GROUP BY THE WHERRY- 
Doourtrts MrrHop 


Scales ok 


Change 3 

Change + Succorance .313 

Change + Succorance + Hetero- 
sexuality 

Change + Succorance + Hetero- 331 
sexuality + Abasement 


Thus, the validity evidenced for the 
EPPS scales in the experimental group 
did not at all hold up in the cross-vali- 
dation group, in that the scales and 
weights that resulted in a multiple & 
of .345 for the experimental group 
yielded a slightly negative composite 
coefficient for the second group. 


SuMMARY 


Previous studies have reported pre- 
dictiveness for certain scales of the 
Edwards Personal Preference Schedule 
(EPPS) in regard to academic over- 
and underachievement when the anal- 
ysis of variance technique was applied 
to selected extreme groups. The pur- 
pose of the present study was to test 
out the EPPS in a situation in which 
no subjects are eliminated from the 
study, the correlation technique is 
used, and both experimental and cross- 
validation groups are included. 

The selection of the best EPPS 
scales through the use of the Wherry- 
Doolittle test selection technique re- 
sulted in a coefficient of multiple cor- 
relation of .345 for the experimental 
group. However, when these scales 
were used for the cross-validation 
group, the value of the correlation co- 
efficient degenerated to —.058. 

The results of the study do not nec- 
essarily indicate a lack of usefulness 
for the EPPS, but they do highlight 


198 


the shortcomings of many of the re- 
search studies which have reported 


Success in prediction with the EPPS 
scales. 


REFERENCES 


Buros, 0. K. The fifth mental measurements 
yearbook. Highland Park, New Jersey : 
Gryphon, 1959. 

Epwarps, A. L, Manual for the Edwards 
Personal Preference Schedule. New 
York: Psychological Corporation, 1954, 

Garrerr, H. B. Statistics in psychology and 


LEONARD J. KAZMIER 


education. 
Green, 1958. 5 

Gesiart, G. G., & Hoyt, D. P. pete 
needs of under- and — Pe 
freshmen. J. appl. Psychol., 1958, 
125-128. i? t 

Kruc, R. E. Over- and unter ee 
and the Edwards Personal i 13, 
Schedule. J. appl. Psychol., 1959, 
133-136. si 

McNemar, Q. At random: Sense aM et 
sense. Amer. Psychologist, 1960, 
295-300. 


New York: Longmans, 


(Received November 29, 1960) 


Journal of Educati 
‘tonal Psycholo, 
1961, Vol. 52, No. 4, 199-200 se 


A NOTE ON RELATIONSHI 


PS BETWEEN STUDENT 


AND PARENT MINNESOTA TEACHER 
ATTITUDE INVENTORY SCORES 


ROBERT E. CAMPBELL axp JOHN E. HORROCKS 
Ohio State University 


Pe ge! contributing to the for- 
pee on of attitudes toward teaching 
ban pani teasiser relationships are 
Senta They are also difficult to 
nies with any degree of compre- 
(1959). in or accuracy. Riccio 
the it in a survey and study of 
identiGel of teacher attitudes 
ables, Ty a number of related vari- 
sonalit; he variables included per- 
and . traits, experiences, values, 
finde, some extent, academic ap- 
viel ze the best of the writers’ 
lias ‘ts ge, an overlooked variable 
ing es the role of parents in help- 
those ne the professional attitudes of 
teacher their children who become 
Possibl e The present study deals with 
sional bs pe between profes- 
and tha itudes of teachers-in-training 
fons noe their parents. Three ques- 
are fia ake considered: How similar 
Diten io Soettindes of students and their 
identify Does the student tend to 
Biren’ y attitudinally more with one 
the rel than with the other? What 1s 
of 9 ittionship between the attitudes 
@ student’s parents? 


ProceDURE 
tude Inven- 
‘allis, 1952) 
tate Uni- 
ents and 
5s were mainly 
ducational psy- 


ery CMTPAT) (Cook, 
an Administered to 
: Sity College of E 
sean parents. The §' 
Phomores enrolled in 


Classroom session # 
Course, A month later 
explanatory letter Wert 


letter stated that the nt 
199 


terested in parental attitudes toward educa- 
tion. The parents were requested to answer 
their questionnaires separately and to re- 
frain from showing their answers to each 
other. Returned answer sheets were care- 
fully screened for misinterpretation of di- 
rections. 
Resutts AND Discussion 


Table 1 shows the relationship be- 
tween the MTAI scores of students 
and those of their parents. The data 
suggest three trends: the moderate 
positive (.45) agreement between par- 
ents’ attitudes; the similarity of atti- 
tudes between students and their par- 
ents; and though tenuously, the 
students’ attitudinal identification 
with the parent of the same sex. 
Daughters and mothers present a 
higher relationship (.37) than did 
daughters and fathers (.15).1 Simi- 
larly the son-father correlation (.33) 
was higher than the son-mother corre- 
lation (.19). Although these differences 
were not statistically reliable, trends 
in these directions were not unexpected 
in view of similar results reported in 
studies made in other areas of parent- 
child identification. Father-son and 
mother-daughter identifications were 
noted by Gray and Klaus (1956) in 
their study of values, and by Hender- 
son (1958) in his study of interests. 
Lazowick (1955), using an S-R ap- 
proach, has also reported on the nature 
of identification. 

Table 2 presents the means and 
standard deviations for the categories 
of groups used. Although the variance 

1The difference between the correlations 


was not statistically significant 
level of confidence. eee ie 


200 


TABLE 1 


Propuct-Moment CorreLations BETWEEN 
Srupents’ AND Parents’ MTAI Scorrs 


Group r N 
Students-fathers -285* 127 
Students-mothers 334* 127 
Mothers-fathers 449* 127 
Sons-fathers 330* 39 
Sons-mothers 189** 39 
Daughters-fathers 149** 88 
Daughters-mothers 369* 88 


* Significant at .01 level of confidence, 
** Significant at .05 level. 


TABLE 2 


Mans anv Sranparp DEvraTIONS oF 
MTAI Scorus ror Parents 
AND STUDENTS 


Group N uM SD 
All students 127 30.4 28.4 
Mothers 127 1.0 34.5 
Fathers 127 —14.3 29.0 
Daughters 88 35.4 28.2 
Sons 39 18.9 25.2 
is homogeneous for all groups, the 


means are significantly different, Since 
the MTAI may be interpreted on an 
authoritarianism-permissiveness con- 
tinuum (high positive scores represent 
permissiveness in teacher-pupil rela- 
tionships whereas low Positive or nega- 
tive scores present authoritarian atti- 
tudes) it is possible to rank daughters 
as most permissive with Sons, mothers, 
and fathers following in that order. It 
would appear that an age as well asa 
sex variable may have been operating 
here. 


ROBERT E. CAMPBELL AND JOHN E. HORROCKS 


Summary anp ConcLUsIONS 


Similarities of teaching attitudes - 
127 teachers-in-training and their oe 
ents were investigated by means 0 ee 
Minnesota Teacher Attitude Inve 
a results suggest the eer 
conclusions: (a) parents are in en 
tively moderate agreement, (0) heii 
positive relationship exists be a of 
the attitudes of students and ar i 
their parents, (c) students ten vent 
identify attitudinally with the pat iH 
of the same sex, (d) daughters pc 3 
most permissive in their concep ag 
teacher-pupil relationships while ae 
are less so, and (e) mothers sai 
fathers tend to be more agree 
in their MTAI responses than do the 
children of either sex. 


REFERENCES R 
Coox, W. Ww, Lerps, C. H., & CaLus, ry 
Minnesota Teacher Attitude ayes 
manual. New York: Psychological 
Poration, 1952. of 
Gnay, S. W., & Kraus, R, Assessment 1 
parental identification. Genet. psycho! 
Monogr, 1956, 54, 87-109. een 
Henverson, H, J. The relationship betwee? 
interests of fathers and sons and soDS 
identification with fathers: The rel@- 
tionship of the adolescent son’s identi- 
Heation with his father to father-s02 
interest similarity as measured by the 
Strong Vocational Interest Blank. UD- 
Published doctoral dissertation, Colum- 
bia University, 1958. . 
Lazowicx, L, M. On the nature of identifi- 


cation, J. abnorm. soc. Psychol., 1955s 
51, 175-203. 


Riccio, A. C. Th 
Variables to 
Unpublished 

tate Uni 


€ relationship of selected 
attitudes toward teaching. 


doctoral dissertation, Ohio 
Riversity, 1959, 


(Received December 29, 1960) 


Journal of Ed = 
Educational Psych. 
1961, Vol. 52, No. 4, 201-208 ee 


TEST ANXIETY AND THE INTELLECTUAL 
PERFORMANCE OF COLLEGE 
STUDENTS: 


IRWIN G. SARASON? 
University of Washington 


A ; 
currently strong trend in the se- 


lecti 
lon of college students is the reli- 


ance . A 
and fer cetes tests of intelligence 
associated e. Very likely the problems 
age popul with an increasing college 
maride f ation and the heightened de- 
going ee admission to colleges are 
ek accelerate this trend. While 
and Me entrance examinations 
Performan e measures of intellectual 
question ce is obvious, a legitimate 
hot fact would seem to be whether or 
titude a other than ability and ap- 
studen . ve test performance of 
ample, th f it were the case, for ex- 
acterigti at certain personality char- 
eileen. correlated negatively with 
Would — performance measures, it 
Comelation i important to take this 
in fees 1on into account in some way 
A iar applicants. 
ible a the problem of the pos- 
in ee ounding of personality with 
Specific fe variables has been stated 
Ollece ly in terms of admission to 
aspect” it is clear that this is but one 
he of the more general problem of 
ra relationship between intelligence 
§ i han This is so because all 
ee ents in most school systems Une 
ae 0 systematic jntellectual assess- 
ent, and educational decisioné fre- 
tently are made on the pasis of this 
Sessment. 


1 

This supported by a grant 

from the paeee ee est Hower om 

T i ig i e! o Pau () 
he writer is indebte’ and Testing 


and 
the staff of the Cour Washington for 


The present research investigated 
the relationship between intellectual 
and personality measures for male and 
female freshman and sophomore col- 
lege students. Thirteen intellectual 


measures were available for each stu- 
high school 


dent, of which six were 

grade point averages in a variety of 
fields, and seven were objective tests 
included in an entrance battery. Six 
personality variables were correlated 
with each of these 13 intellectual 
measures. Three of these six person- 
ality variables were measures of what 
might be called general anxiety, i.¢., 
anxiety as expressed or experienced in 
a variety of situations. One other 
measure tapped the student’s feelings 
of hostility in a variety of situations 
and another measure was designed to 
assess the subject’s level of defensive- 
ness. The sixth personality variable 
studied, and the one which it was felt 
would most strongly relate to intellec- 
tual performance, was test anxiety, 
i.e., anxiety experienced specifically 
in testing situations. The prediction 
of significant relationships between 
test anxiety and intellectual perform- 
ance was based on the following for- 
mulation. 

Anxiety responses are aroused by 
personally threatening conditions in 
the environment. These responses in- 
terfere with ongoing task-relevant ac- 
tivity and lead to a lowering of per- 
formance level. Although the exact 
nature of the interfering anxiety re- 
sponses is not known, it seems likely 
that they include both heightened 


201 


202 


physiological activity (e.g., heart rate, 
palmar sweating) and self-deprecat- 
ing ruminations (eg., “I can’t pass 
this test,” “I wonder how the other 
students are doing”). While these anx- 
iety responses are assumed to be 
aroused under stressful conditions, 
their evocation would not necessarily 
be expected under neutral conditions 
(Sarason, 1960). 

Within this view of anxiety, anxiety 
scales or questionnaires may be con- 
sidered to measure the likelihood of 
emission of anxiety responses under 
conditions of personal threat. Further- 
more the closer the content of the anx- 
iety scale is to the situation in which 
the subject’s performance is measured 
the better predictor of reaction to 
threat will the anxiety scale be, Since 
the test anxiety scale used in this 
study contains items dealing with the 
subject’s reactions to testing situa- 
tions, it was expected that it would 
most meaningfully relate to perform- 
ance. Because of the importance of 
college aptitude tests to students it 
was predicted that test anxiety would 
correlate negatively with the seven 
entrance examination tests. The cor- 
relations of test anxiety with the six 
high school grade point averages were 
expected to be of a lower order than 
those with the entrance test battery. 
This latter prediction was made be- 
cause of the possibility that while 
highly anxious students might not be 
able to prepare for and cope ade- 
quately with entrance and aptitude 
tests, they well might be able to re- 
duce their anxiety levels over a ge. 
mester or school year in course work. 


Mernop 


Subjects. The subjects were 326 male and 
412 female students enrolled in introductory 
psychology and introductory . Sociology 

irses at the University of W ashington, 
The Autobiographical Survey containing 


IRWIN G. SARASON 


the personality measures (Sanat eel 
was administered during regular aT ana 
sions. The aptitude measures were aia 
prior to the subjects’ entering the eeanliti 
sity of Washington. Thus the Bee ate 
and intellectual measures were not obts 
contemporaneously. ae 

Materials, The form of the se 
graphical Survey used in this nee 2 ety 
sists of six true-false scales: Test GA) 18 
(TA), 17 items; General Anxiety ¢ ‘+emn8} 
items; Lack of Protection (LP), si raver 
Hostility (H), 22 items; Need for Ac (), 
ment (NA), 31 items; Defensiveness Pate 
24 items. Descriptions of the scales sb 3 
been presented previously (Sarason i aa 

The 13 intellectual measures employe sit’ 
routinely gathered for entering Ee et 
of Washington students. The measures 
- High school English average 
- High school mathematics average 6 
- High school foreign language averaé' 
- High school social studies average 
- High school natural science avertge 
- High school electives average ’ i 
- Verbal score on the Guilford-Zimme? 
man Aptitude Survey the 

8. Mechanical Knowledge score on © 
Guilford-Zimmerman Aptitude Survey 

9. Cooperative English Usage score 

10. Cooperative English Spelling score 

11. Cooperative Mathematics I score 

12. Cooperative Social Studies II score 

13. ACE (1948) Q score 


Noapwye 


REsuurs 


The results for the male and female 
subjects will be presented firsb i? 
terms of the correlations between the 
Intellectual and personality measures 
and then in terms of the intercorrela- 


tions of the Autobiographical Survey 
scores, 


Table 1 contains the correlation® 
bet; 


Ween the 6 personality and 13 in- 
tellectual variables for the 326 male 
Subjects. The table also contains the 
Means for the 13 intellectual measures- 
though A correlated negatively 
and Significantly with 4 intellectual 
Measures and NA Correlated nega- 
tively and significantly with 5 of these 


* Copies of the Autobiogr, 


may be obtained from the ai ‘phical Survey 


uthor, 


TEST ANXIETY AND INTELLECTUAL PERFORMANCE 


— TABLE 1 
EL, 
peiang | or Srx AUTOBIOGRAPHICAL 
¥ ScORES WITH THIRTEEN 
ELLECTUAL MEASURES FOR 
326 Mates 


Measure TA|GA|LP| H 


High school English 
Average 

High School mathe- 
Matics Average 


—09|—04} 00 —06|—05) 06 | 2.82 


—18]—08|—01|—03!—09 


Pons 
aoe —09| 03] 09|-03|—07| 00 | 2.57 

OC: ; 
oe aver- |—17|—04|—01|—08)-01) 07 | 2.95 
tee Science ay- |—15|—07| 02|—07|—06] 07 | 2-78 
Gz Verna = 14] -07|—04) 12} 09] 07 | 3.08 
Zooey Tyelcuil—er|eo|—ae] os | 21.85 
eage e Enatish |—27) —12] —02)—06)—14] 11 | 88.61 
Spelling” English |—18|—05|—o4| 00|—11] 03 | 10.17 

Or * 
Sia © Mathe- |—13/—10|—03| 00] 00/ 11 | 23.71 

‘Or . 
acti Social |~28|—11]—01] 06|—13] 08 | 18.45 
Score (1948) |—30|—17|—06|—01]—06) 10 | 46.98 


Note—, = 
—T = .133 at .05 level; r = -148 at .01 level. 


Mea: — 
= abae it is clear that TA shows 
‘or 1] i ee set of correlations. 
lere wer he 13 intellectual measures 
ati 'e significant negative corre- 
Ons with TA. 
412 Pee aia set of correlations for 
0 ba ar (Table 2) also shows HL 
Cant relate negatively and signifi- 
Mea, Y with 11 of the 13 intellectual 
‘sures, although the same 41 vari- 
mise were not significant for both 
Der: &s and females. The only other 
relat, nality characteristic found oe 
ca to intelligence W8S Hw - h 
ae positively with four inte 
oe measures. — 
‘or both the mae 
eu, compan an 
hi relations obtained oe verages and 
i akool G7 ee For ‘the pur- 
and entrance Wr son the six TA 
Doses of this comparison x TA 


nd female 


203 


grade point averages were compared 
with six TA entrance test scores. Not 
considered were the male and female 
TA-Mechanical Knowledge correla- 
tions. This omission was based on the 
great differences between male and fe- 
male means on this test. The compari- 
sons made indicated stronger negative 
correlations between TA and the six 
entrance tests than between TA and 
the six high school grade point aver- 
ages. This result was significant for 
both males and females (p < .05 and 
p < .02, respectively). 

Intercorrelations between the six 
Autobiographical Survey measures 
and the means for these measures are 
presented in Tables 3 and 4, for males 
and females, respectively. In general 
the patterns of intercorrelations for 
males and females are quite compara- 
ble. For both sexes the D measure is 


TABLE 2 
CorreLaTIoNs oF SIX AUTOBIOGRAPHICAL 
Survey Scores WITH THIRTEEN 
INTELLECTUAL MEASURES FOR 
412 FEMALES 


Measure TA|GA|LP| H |NA| D Mean 

High school English —12| 02] 01] 02] 00/02 3.30 
average 

High school mathe- |—12] 03) 06) 01}—01/—07) 2.81 
matics average 

Foreign language —14} 10} 05) 07/—05)—07 3.08 
average 

Social science aver- —13] 02] 03] 05/—02/—04) 3.24 
age 

Natural science av- |—19)—01) 05 06/—04} 00] 3.06 
erage 

Elective average —06]} 00} 01/—01) 05)/—01) 3.33 

G-Z Verbal 22} 00] 07] 13/—07}—03) 27.55 

GZ Mechanical |~08}—09}—04) 10)—05/—03) 4.51 
Knowledge 

Cooperative English |—25) 00) 06) 09)—03 —o2! 101.78 
Usage 

Cooperative English —16] 03) 00) 11)—04/—03) 20.47 
Spelling 

Cooperative Mathe- |—21]—04] 02) 09)/—O1) 00) 17.15 
maties I 

Cooperative Social |—29/—05} 04) 11/08] — 
Studies II ce 

ACE Q score (1948) |—18]—04] .01] 07] 06/—01} 43.88 


Note.—r = -098 at .05 level; r = .128 at .01 level, 


204 


TABLE 3 
INTERCORRELATIONS OF Six Auro- 
BIOGRAPHICAL Survey Scores 
FOR 326 Maes 


Scores | TA | GA | LP 


H |NA D Mean 
TA 53 | 35 | 13 | 34 | —50] 4.97 
GA 58 | 23 | 34 | —64] 5.57 
Lp 33 | 37 | —56| 6.32 
H 20 | —40) 8.86 
NA —35) 11.57 
D 12.50 


Note.—r = .113 at .05 level; r = .148 at .01 level. 


TABLE 4 
INTERCORRELATIONS or Srx Avro- 
BIOGRAPHICAL Survey Scorgs 
FoR 412 Femares 


Scores | TA | GA | Lp H 


NA] D Mean 
TA 48 | 35 | 08 | 35 | —45| 5 64 
GA 54 | 26 | 31 | —60| 6.49 
LP 33 | 24 | —55! 7.08 
H 14 | —44) 6.96 
NA 31] 10.97 
D 13.42 


Note.—r = .008 at .05 level; r = 128 at .01 level. 


negatively related to each of the other 
Autobiographical Survey measures, 


Discussion 


The present findings demonstrate 
that for both men and women there 
are significant negative relationships 
between anxiety reported by subjects 
as being experienced in testing situa- 
tions (TA) and a variety of measures 
of intellectual performance, These re- 
sults also confirm the findings of g 
previous study (Sarason, 1959) which 
showed (a) TA to be negatively ye. 
lated to many of the same intellectual 
variables with which the present study 
was concerned and (b) other person- 
ality measures to be unrelated to in- 
tellectual performance. In the present 
research none of the personality scales, 
except TA, related significantly to 


IRWIN G. SARASON 


performance in any consistent eo 
for both male and female college ys 
dents. Thus, support was quae * 
the hypothesis that the more sp this 
the measure of personality ae 4 
case, anxiety) is to the situation wil 
studied, the more gece a 
the personality measure relate . a a 
formance (Sarason, 1960). It pee 
pothesized also that TA scores titude 
be more negatively related to ap hig 
test measures than to the six ne 
school grade point averages. =e ay 
ent study suggests that this wel ent 
be the case. The Need for Achievem! ve 
Scale might have been expected to ane 
late negatively to intellectual P on 
formance since its items appear ‘5 
reflect strong need and aniely 
achieve. There was an indication “ 
this might be true for male stude 
ut not for female students. ed 
Two problems then seem to be et 
by the present findings of consiste 
negative relationships between as 
and 13 intellectual measures and nee 
of greater strength of those negati = 
relationships involving aptitude — 
88 compared with high school gr@ a 
Point averages, First, what facto : 
contribute to the negative relation’ 
Ships obtained? Second, why is tes 
anxiety less of a detrimental facto! 
fe etourse grades than for aptitude 


it Vith respect to the first question 
it would Seem prudent at present to 
pavers Pret the present results as simply 
Indications of Poorer intellectual per- 
ame for high than for low test 
isle Students. If this is due to in- 
diffe differences among subjects 
= oe thst hia ne, then it may be 


to determine 


TEST ANXIETY AND INTELLECTUAL PERFORMANCE 


hag Peyehdlegical aspects of testing 
‘sie a it emic situations arouse anx- 
Cie in high TA scorers and 
the eats or not the conditions of either 
pat bien situation or of anxious stu- 
Si te co be modified in such a way 
ies ne anxious students’ perform- 
studies + ie Results of experimental 
eatin lcate that under neutral or 
anxion § conditions high and low 
perform Subjects do not differ in their 
Rina ance levels _but that under 
fon ie do differ in the same direc- 
jean, oo in the present re- 
Palola, icon 1958a; Sarason & 
possible ny, of this evidence it seems 
question hat the answer to the second 
tion of ‘eo that of the explana- 
- he stronger correlations with 
grade aptitude test scores than of 
Sommer averages, may be in the 
a nity of the student in class to 
Th the cence and to reassure himself. 
ter the ae of a quarter or a semes- 
iter be anxious student can come to 
iy hae instructors and peers, as well 
G aay his familiarity with and 
isn earn the material to be learned. 
in a pi are largely absent 
phasis Pp aptitude testing. Greater em- 
ie a future research on the inter- 
orma; a1 aspects of learning and per- 
. ne Nee situations should make 
ings er the implications of these find- 


sun teoretically, the present study has 
Pported the interfering response In- 
erpretation of anxiety discussed ear- 
ier. These results are consistent with 
a Conception of the TA as an index 
°l proneness to performance disrup- 
ion under personally threatening con- 
itions. Practically, these findings 
need to take ac- 
Count of the test anxiety factor in in- 
terpreting students’ intellectual per- 
ormance. 


One factor of jmportance in the 


205 


present study concerns that of the 
population of subjects used. While TA 
correlated consistently with the in- 
tellectual measures used, in most cases 
the magnitude of the r’s was not great. 
It seems possible that the restriction 
in range of intelligence inherent in a 
college sample may have worked 
against obtaining even more signifi- 
cant 7’s than were found in the pres- 
ent research. Future research with 
high school level subjects and a 
greater range in intellectual ability 
should, if this is correct, lead to larger 
r’s than were obtained in this study.* 

The intercorrelations of the Auto- 
biographical Survey scales show es- 
sentially the same patterns for the 
women as for the men. Interestingly 
the present intercorrelations for col- 
lege students are quite similar to the 
intercorrelations previously reported 
for hospitalized psychiatric patients 

(Sarason, 1958b). The only sizable 
differences in the correlations for the 
two subject populations are the r’s in- 
volving the Hostility scale. For the 
patient population this scale corre- 
lated much more strongly with the 
other Autobiographical Survey meas- 
ures than was true in the case of col- 
lege students. 

SuMMARY 

The present study correlated scores 
on six personality scales of the Auto- 
biographical Survey with scores on 
seven aptitude test scores and six high 
school grade point averages. The sub- 
jects were 326 male and 412 female 
college students. 

The results demonstrated that of 
the six personality scales only the 
Test Anxiety scale (TA) related con- 
sistently to the 13 intellectual meas- 
ures. For both males and females, TA 

*A recently 
by the esas aires weal een 


subjects ri 3 
neat provides support for this expecta- 


206 


correlated negatively and significantly 
with 11 of the 13 intellectual meas- 
ures. There were indications, however, 
that the negative correlations with TA 
were stronger for aptitude test scores 
than for grade point averages. These 
results were interpreted as supporting 
the hypotheses that: (a) the closer 
the content of anxiety scales’ items is 
to the situation in which subjects are 
to perform the more predictive will 
the items be, (b) TA scores reflect the 
Proneness of subjects to emit inter- 
fering anxiety responses under con- 
ditions of threat i 

Comparisons o 
matrices of the 


IRWIN G. SARASON 


ne 
those previously reported for a neur 
psychiatric population. 


REFERENCES 


ing of 
Sarason, I. G. Effects on verbal arene 
anxiety, reassurance, and Soa 1958, 
ness of material. J. exp. Psychol 
56, 472-477.(x) = 
Sarason, I. G. Interrelationships ae 
dividual difference variables, padition- 
in psychotherapy, and verbal ee ; 
ing. J. abnorm. soc. Psychol., 1958; 
339-344 .(b ity 
Sarason, I. eS Intellectual and gece 
correlates of test anxiety. J. abt 
soc. Psychol., 1959, 59, 272-275. 1 theo- 
Sarason, I. G. Empirical findings oe siety 
retical problems in the use of 403-415: 
scales. Psychol, Bull., 1960, 57; rela- 
Sarason, I. G., & Patona, E. G. ane 
tionships of test and general ee in- 
difficulty of task, and experimen Psy* 
struction to performance. J. exP- 
chol., 1960, 59, 185-191. 


(Received January 23, 1961) 


Journal of Ei 
Educational P. 
1981, Vol. 52, No. 4, 207-214 paeey 


EF 
F ~—— OF MEMORY, EVALUATIVE, AND CREATIVE 
ADING SETS ON TEST PERFORMANCE 
E. PAUL TORRANCE ano JUDSON A. HARMON 


University of Minnesota 


For 

ean years the senior author 
Perimentiz, 1950, 1951) has been ex- 
ures whee with a variety of proce- 
lihoey hich might increase the like 
courses a students in mental hygiene 
developed I make use of scientifically 
n their ht ss of mental health 
ments Poem Several recent experi- 
tivated p trance, 1960) have been mo- 
0 dinner. the belief that the failure 
Use of su -- to make more effective 
fact that a3 principles results from the 
° think ey: have not been educated 
about ¢ creatively or imaginatively 
Came Pain materials. It soon be- 
that aie ent in this experimentation 
hot have if perhaps most, students did 
Tesults he expectation that research 
Principles scientifically developed 
ively in could be applied imagina- 
Sona] Beal poner many of their per- 
est they a problems. At 
Prescriptions ere looking for specific 
Sittee shocking realization that 
0 ment praten in a graduate course 
to be “ hygiene did not even expect 
ere le to use the knowledge they 
0 ty acquiring led the senior author 
appre to develop ways of establishing 
Tange priate “sets,” In one study (Tor- 
Classe. 1960) random halves of two 
arti es were asked to read research 
sicles creatively or critically. Those 
Wo read and veponed 
in q ely were later ‘ae their own 

eveloping a new idea Ls Aecseo eee 

Sn responding test of or =f ne 
Pplications. Thes¢ results oe ey ie 
the present study, ®% extenSIOn o 


Sets” to textbook F 
propr 


eadings- 
jate prepara- 


The effect of ®P ne 
é 


tory sets in facilitating learning has 
been well-established (Johnson, 1955, 
p. 77). Usually a person’s activities 
are integrated and he does not react 
to every event that stimulates his 
sense organs. He is selective not only 
of what he responds to but also of 
what is produced and of the mental 
operation used in responding. 

In designing the present experiment, 
the authors have been guided by Guil- 
ford’s (1959) model of human intel- 
lect. Particular attention has been 
paid to his five mental operations: 
cognition, memory, convergent think- 
ing, divergent thinking, and evalua- 
tion. By “cognition,” Guilford means 
the operations involved in recognizing 
facts, discovery, and _ rediscovery. 
Memory involves the retention of 
what has been cognized. The two kinds 
of productive thinking, convergent 
and divergent, include the generation 
of new information from known in- 
formation and remembered informa- 
tion. Convergent thinking leads to a 
conventional solution; divergent 
thinking leads to new and untested 
solutions. Evaluation refers to deci- 
as to the goodness, suitability, 


sions 
or adequacy of what we know, re- 
member, or produce through conver- 


gent or divergent thinking. In this ex- 
periment a deliberate attempt was 
made to establish sets which would re- 
sult in utilization of these operations, 


METHOD 


Subjects 

The subjects were 115 students in a 
graduate level course in Personality Devel- 
opment and Mental Hygiene at the Uni- 


208 


versity of Minnesota during the summer of 
1960. Seventy-six were males and 39 were fe- 
males. Most subjects had had _ professional 
experience in one or more Phases of educa- 
tion; 5 were public health nurses and 10 had 
had no professional experience. The average 
number of years of experience was 6.6 years 
and the Tange was from 0 to 30 years. 

Subjects were assigned alphabetically to 
one or another of three groups (1, 2, or 3). 

€ group number signified the reading set 
to be followed during a period of one week. 
This method of assignment to groups re- 
sulted in no biases due to sex, professional 
field, or number of years of experience. Some 
members of the original sample canceled 
the course or otherwise failed to Participate 
in certain phases of the study, making the 
size of the three groups slightly unequal. 


Description of the Reading Sets 


_ The instructions of the memory, evalua- 
tion, and creative application sets were 
given to the subjects as follows: 

Set A—M 


grammar, punctuation, 
and the like. It may tak, 
...(ete., same ag above). 

Set C—Creative Application. 
read, it is important that you tl 
the many uses of the informa; 
you are reading. It is especially 
that you think of the vari 
which the information coul 


When you 
hink about 
tion which 
important 
10us Ways in 
id be used in 


E. PAUL TORRANCE AND JUDSON A. HARMON 


i ife. In 
your personal and professional t as 
reading, do not just ask, yas a ee tae 
thor saying?” Also ask, How Ya 
what the author is saying?” Do any Uses 
with just one use. Think of as coe wee 
as you can of the important 1 viges £0 
sented. Jot down some of these oF pre 
future reference. It may take sor 
tice... (ete.). 


Experimental Procedure 


perimental 
At the beginning of the first exer struc 
week, subjects received reading ee whic! 
tions in accord with the group rbject. Te 
they had been assigned. LO ea week: 
ceived only one set during any pjects i 
Thus, during the first week ie Group 2 
Group 1 received Set A; those wand £0 02 
Set B; those in Group 3, Set C; SB grouP 
—sets being shifted from group 
over the 3 weeks, ‘ovided 28 
Each week each subject was pro a which 
Evaluation of Reading Set Form poste which 
he was asked to report the degree ad sets 
e was able to maintain the assign using 
the degree of difficulty experienced aa i 
that set, the number of minutes F ter the 
completing the reading assignment tions 
ast day of the week, and semantic ce atl 
to the experience. Osgood’s (1952) sem ining 
ifferential technique was used in ove 
the latter. The following polar-adie Jes: 
Pairs were used with seven-point Seay 
800d-bad, Pleasurable-painful, labored a - 
Wise-foolish, active-passive, fast-slow, nk 
tious-rash, new-old, and interesting-bor! red 
Each Friday subjects were adminisie: A 
2 20-minute test which covered the reac 
assignment, for that day. This test ane Ule as 
items which were defined as: cognitive ae 
tiple-choice), memory (fill-in-the-blanks" 
ivergent thinking (extensions from ee 
acts, uses of facts, questions, hypothes ds 
ete.), and evaluation (decision making Let 
ations and justifications of decisions). It Ww 
at this time that the Evaluation of Readiné 
Set Forms were collected, 
art-scores based on performance i 
these subtests were determined for all sv 
jects and were Teported to each subject # 
the beginning °! 


The following are samples of items pre- 
Sumed to involve cognition: 
According to a recent study by two New 
ork selentasts iavolving tests of physica 
tness, which of the follow; ines was 
obtained? m6 findings w 


EFFECTS OF READING SETS ON TEST PERFORMANCE 


~ children. 
~—American children i 
ae children. ner 
—Russian childr ri i 
eee dlaan, soul 
American child ior 
eer chiltiren, nemo e 
part ted we following is least likely to 
rel a © the prevention of paranoid 
ee in straight thinking. 
ering attitudes = 
pein. ihoneiagigi 
~—Finding available objects to blame. 


European children superior to Ameri- 
attitudes 


Developing 
=. Sard sex. 
he oe social interaction. 
ory items: Wing are examples of the mem- 
Accordi 
in cess to Wang, the primary factor 
mality maladjustment is either 
eS ild’ 
Need for of the child’s 
a parental « i 0 - 
thetic handling, affection and sympa 
eal ce ee who used a mechani- 
niversi er in laboratory studies at the 
The Heed of Wisconsin is - 
Plieation ee the creative-ap- 
1 ‘ 
hunger needs are not satisfied during 


self-ennobling 


chil c 
Pt how is this likely to affect per- 
child fice ep rg Assume that the 
sikbenos Buk caeie enough food for sub- 
fer, Wi Done enough to satisfy hun- 
you expect? of adult personality would 
hones has shown that lower socio- 
to del, ic individuals have limited ability 
as yo ay gratification. List as many ways 
mation in eduentional using this infor- 
mot mn in educational situations to pro- 

e healthy personality adjustment. 

€ following is a sample of the evalua- 


‘ive problems: 
mber of the 


olis Gen- 


| 1 eee a he 

al i in the zu 
dren’s se, Onl at the University 
of Michigan and it is estimated that the 
Cost of treatment Per child will be $6,500. 
Tn deciding this iss¥e 
vote? 
—Yes ——No 
How would you j¥ 
many sound reaso? 


stify your vote? Cite as 
g as you can. 


209 


Analysis of Data 

Analyses of variance were computed, 
comparing the effects of the reading sets of 
each of the different kinds of test perform- 
ance (cognitive, memory, divergent think- 
ing, and evaluation) for each of the 3 weeks. 
The evaluative-type item, unfortunately, 
was used in only one of the tests. Analyses 
of variance were computed for success in 
maintaining set, difficulty in maintaining 
set, number of minutes spent in completing 
the reading assignment, and each of the nine 
semantic differential scales for the three sets. 
Analyses of variance were also computed 
on Test Scores X Weeks and for Weeks X 
Groups in order to determine relative per- 
formance by weeks and by groups. 


RESULTS 


The data comparing scores oP the 
memory-type items are presented in 
Table 1. It will be noted that only 
during the third week were the differ- 
ential effects of the three sets statisti- 
cally significant at the .05 level of sig- 
nificance or better, the memory set 
producing the highest score. The pre- 
dicted trend occurred during the sec- 
ond week but was not statistically 
significant. 

Table 2 presents the means and 
analysis of variance data for scores 
on the Creative-Applications or diver- 
gent-thinking items. It will be noted 
that the differential effects are rather 
strong for each of the 3 weeks and that 
in all cases those working under the 
creative set achieved the highest mean 
score. Those using the evaluative set 
consistently achieved the next highest 
mean score. On the third test all three 
groups tended to achieve higher scores 
than during the earlier weeks. This is 
probably due to the accumulative ef- 
fects of the course which emphasized 
creative thinking concerning the 
course content. 

The data concerning performance 
on the evaluative (decision making) 
items for the second week are shown 
in Table 3. The differential effects are 
statistically significant, with the eval- 


210 £. PAUL TORRANCE AND JUDSON A. HARMON 


TABLE 1 


IN 
HREE READING SETS ON —— 
{PLETION) FOR Eacu or Toren WEEK! 


Comparison oF Errects or T; 
Memory Typg Tests (Cox 


: tio 
Week Reading set N a Between SS Within SS Fr 
First Memory 40 6.04 2.226 
Evaluative 38 6.24 3.9259 195.7898 
Creative 36 5.78 
Second Memory 33 7.32 2.548 
Evaluative 41 6.65 9.8958 420.2684 
Creative 37 7.20 
Third Memory 38 8.30 429* 
Evaluative 34 8.15 12.7482 405.2544 3. 
Creative 40 7.54 
"P< .05. 
TABLE 2 


Comparison or E 
Creative 


S or Tests ror Eacu or Toren WEEKS 
Week Reading set N af Between SS Within SS pails 
First Memory 40 4.25 + 
Evaluative 38 5.43 72.4602 643.6542 12.496 
Creative 36 6.18 
Second Memory 33 4.45 4k 
Evaluative 47 'ge 262.6431 379.4977 76.1493 
Creative 37 7.96 
Third Memory 38 6.88 + 
Evaluative 34 8.37 13.7613 
Grentiva ah 8.60 66.2446 524.4794 
“p<. 
TABLE 3 . ia item 
experi an 1 
CoMPARISON or Errecrs op Tauren Ee A tipas tote — eek. 
ReApvING Sets on Prrrormancr ty re tor the second wi the 
Evauvativs (Drcrston Maxtna) Table 4 presents the data for e 
Type or Test ror Ssconn Werk Cognitive items for each of the thre 


tests. It had been anticipated th 
F ratio Subjects using tl 


Be- — 1 
Reading set} w | ar tegen Within he memory set wou 


Perform best on these items. This ot 4 

curred only during the second week: 
Memory 33 | 5.88 he diff 2 sig- 
Evaluative | 41 | 7.34 | 39.1999! 368.4517 11. 469¢* : ‘rential effects were also 
Creative 37 | 6.70 Nificant for the ] 


i Working under 
“p< .01. F 


. will . i 
uative, creative, and memory sets the evaluating oy at subject eae 
ranking in that order. Although this edge during the first 
type of item was used at other times ferential effects wer 
during the course, the only test in the 


EFFECTS OF READING SETS ON TEST PERFORMANCE 


211 


TABLE 4 


Comp 
ARIS! 
ON or Errects or THREE ReapING SETS 


on PERFORMANCE ON COGNITIVE 
Eacn or THREE WEEKS 


Tyre or Tests (MuLtiPLe-CHOICE) FOR 
Week 2 
Reading set N M Between SS Within SS F ratio 
Fi 
= Memory 40 6.21 
Evaluative 38 6.52 1.9062 236.9290 0.893 
Creative 36 6.33 
Se 
ie Memory 334.03 
Evaluative 41 3.32 10.3007 200.9297 5.537** 
Creative 37 3.48 
Thi 
as Memory 38 4.68 
Evaluative 34 5.06 7.3804 211.8738 3.797* 
Creative 40 4,42 
eS 
P<.01 


Test p 
Week erformance by Group and by 


A : 
hiner of variance were also per- 
ormance poner differential test per- 
Was found i group and by week. It 
are not, st, iat the effects for groups 
Clleets for atistically significant. The 
Nificant weeks are statistically sig- 
higher total mean scores being 
le first i the third week than for 
evidence Wo weeks. Thus, there is no 
One seq, of the superiority of any 
quence of sets over the others. 


Othep 
er Reactions to Assigned Sets 


ee 
is Sailure to maintain an assigned set 
tion viously an important considera- 
of g m determining the effectiveness 
Sult; Set in producing the desired re- 
S. Two questions 0? the Eval- 


vats 
ation of Reading Se Form were 
rmation con- 


signed inf 

: o supply into 

°erning this problem. Responses Con- 
s with which 


Cer: 
oe the completenes ay! 
we Set was maintained ranged trom 
Not at all” (Point 1) te «g]most all of 
Ne time” (Point 7)- Responses agp 
Cerning di in learning to use the 
a keer Bat ‘Himpossto ee. 
) Sowtr easy” ‘oin f 

to “extremely a) Lateran 


Weights were 2 


Means and F ratios are presented in 
iables: success 10 


Table 5 for the vari 
using set, difficulty in using set, num- 
ber of minutes spent in reading as- 
signment for day, and for each of the 
semantic scales. 

From Table 5 it wi 
the sets had significan 
fects on both success and di 
using set but not on time spent 
reading assignment. The memory set 
seems to have been the most success- 
fully and easily maintained and the 
creative set seems to have been the 
least successfully and easily main- 
tained. Subjects estimated that they 
maintained the memory set on the 
average about two-thirds of the time, 
finding it between “fairly difficult” 
and “fairly easy” to maintain. They 
reported that they were successful in 
using the creative set less than half 
of the time. 

From Table 5 it will also be noted 
that on only four of the nine polar- 
adjective pairs are reactions signifi- 
cantly different among the three con- 
ditions of reading set. Those pairs are: 
fast-slow, cautious-rash, new-old, and 
interesting-boring. Subjects using the 
memory set tended towards fast, cau- 
tious, old, and boring. Those using the 


ll be noted that 
t differential ef- 
fliculty in 
on 


212 


Compartson oF 
Dirricunry 1 


E. PAUL TORRANCE AND JUDSON A. HARMON 


TABLE 5 


G SET, 
Errects or Turep Reapine Sets on Success 1x USIN 

N Usine Ser, Trae SPENT o 
Semantic Reaction FoR THRE 


N READING AssIGNMENT, AND 
& Werks ComBINED 


— Memory Evaluative Creative F ratio 
Variable 
N aa N M N Mw 
Bt* 

Success in using set 106 5.18 | 109 4.06 | 105 3.79 3. 
Difficulty in keeping as- 106 4.54 110 4.10 105 3.97 

signed set 0.23 
Number of minutes} 104 12.27! 110 | 114.52] 105 | 112.00 

Spent on reading as- 

signment 0.59 
Good-Bad 105 2.47 109 2.92 102 2.59 0. 87 
Pleasurable-Painful 103 3.08 | 107 3.16 | 103 3.01) Od 
Labored-Easy 103 3.96 | 107 3.64] 103 3.73 0.30 
Wise-Foolish 103 2.45] 109 2.37] 102 2.36 “46 
Active-Passive 103 2.64 | 107 2.67] 102 2.48) lve 
Fast-Slow 102 3.96 | 107 4.38 | 103 4.33 eee 
Cautious-Rash 102) 2.86) 107 | 2'97] ioe | 3533 ree 
New-Old we | EPS) Wor | gael agp | acca Be 
Tateresting-Boring 103 | 2-76) 107 | 3°95] i095 2.13 | 17. 

**p <.01. 


evaluative set tended towards slow 
and held to a middle Ppositi 


interesting, 

In order to de 
not there are diff 
the passage of ti 


ata 
of analyses of variance showed, Dur- 


ing the second week subjects tended 
to report greater Success in maintain- 
ing the sets than during the other 
weeks. They also tended to spend 
somewhat more time on the readin 
assignment during this particular 
week. Finally, there was & consistent 
change towards fast and boring Over 
the 3 weeks. Of these four Variables 
the time spent on the reading assign 
ment was significant at 01 < F< 05. 
The other three were significant at 
<.01 level. 


Discussion 


From the data which have been ee 
Seems clear that instruct 


sented it 


Mature students to assume var 
reading sets will have differential ¢© 
fects on the kind of goals aehieye 
This would seem to be especially ee 
if creative thinking and evaluativ® 
inking are desired as outcomes. 4 
though the students in the presen” 
Study were able to maintain the —s 
tive and evaluative sets only abou 
half of the time, attempts to maintal? 
these sets had measurable effects 0? 


Performance on tests covering th 
Teading assignment, 0 

ince most examinations given # 
large Classes 


tiple-choice) type, it is worthwhile to 
Note that the memory set does not uni- 
ormly result, in the best performance 
on this type of test, I 


EFFECTS OF READING SETS ON TEST PERFORMANCE 


te : 
i to result in better mean scores 
ing the other 2 weeks. 
of rene of the differential effects 
an nree sets on the four types of 
strnptinen” might also infer that in- 
i id need to develop tests de- 
achieve Measure different kinds of 
af een depending upon course 
iidere ted instructors expect their 
valnati think imaginatively and 
it is web about what they learn, 
4 oe doubtful that such 
tradition adequately assessed by the 
pletion nal multiple-choice and com- 
The ene ; 
ss a indicate that students are 
emory Ccessful in maintaining the 
ain tf; set and find it easier to main- 
evaluatis set than the creative and 
rom th . sets. This probably results 
more © fact that they have had much 
ne hag with this set than with 
tome = 2 This hypothesis receives 
Semantie ie from reactions to the 
Sahieane differential. For example, 
0 ate Hc the memory set tend 
of cautig € experience in the direction 
tha with’ old, and boring. It is likely 
ib ne 2 practice, students will find 
evaluatin to maintain creative and 
. Ive sets and that better results 
herein achieved than those reported 
5 Cinally, in interpreting the results, 
fe be recognized that the sub- 
almo are graduate students and that 
a st all of them have had rather 
€nsive experience in various types 
educational work. 


SuMMARY 


t This experiment was designed to 
€st the differential effects of aera 
€Valuative avid creative “sets” in 
Yeading assignments © aan 
€Vel course on personality Develop- 
Ment and Mental Hygiene. The 115 
Subjects were arranged piphabetsally 


213 


and assigned to one of three groups. 
Each group was assigned in turn each 
of the three reading sets to be used in 
the assigned readings for a week. At 
the end of each week, a 20-minute test 
containing cognitive, memory, crea- 
tive thinking, and evaluative items 
was administered. Subjects also esti- 
mated their degree of success in main- 
taining the set, difficulty in maintain- 
ing it, the number of minutes spent 
on the Friday’s reading assignment, 
and their semantic reactions to nine 
polar-adjective pairs. 

Analyses of variance indicated that 
the three sets produced differential ef- 
fects on most of the tests for all 3 
weeks. Those operating under the 
creative set, achieved the highest 
mean each week on the creative appli- 
cations. Those operating under the 
evaluative set achieved the highest 
mean on the evaluative or decision 
making problem when such a problem 
was given (second week only). Those 
operating under the memory set 
achieved the highest mean on the 
memory tests (completion) only dur- 
ing the third week. The differential 
effects on the memory items were not 
statistically significant during the 
other weeks. Differential effects on 
the cognitive items were not consis- 
tent. Those operating under the mem- 
ory set achieved the highest mean 
during the second week and those un- 
der the evaluative set led during the 
third week. 

Subjects reported that they found 
jt easier to maintain the memory set 
and rated themselves as more success- 
ful in maintaining this set than the 
other two. There were no differential 
effects on the estimated length of time 
spent on the reading assignment. On 
the semantic differential, subjects un- 
der the memory set tended towards 
fast, cautious, old, and boring. Those 


214 


under the evaluative set tended to- 
wards slow and those under the erca- 
tive set, towards rash, new, and in- 
teresting. 

During the second week subjects 
generally reported greater success in 
maintaining sets than during the first 
and third weeks. On the semantic 
evaluations, there was a consistent and 
Significant drift towards fast and bor- 
ing. The three original groups did not 
differ Significantly, giving no evidence 


of the superiority of any one sequence 
of sets over the others 


REFERENCES 


Gutrorp, J. P. Three faces of intellect, 


Amer. Psychologist, 1959, 14, 469-479, 


E. PAUL TORRANCE AND JUDSON A. HARMON 


ight 
Jouxsox, D. M. The psychology bP mea 
and judgment. New York: 

1955. asurement 
Oscoop, C. E. The nature and ro 4982 49, 
of meaning. Psychol. Bull., D 

192-237. 

siseae ee E. P. The phenomenon of 
sistance in learning. J. abnorm. 
Psychol, 1950, 45, 592-597. one 

Torrance, E. P. Getting men ae college 
practices into action throug) 98-95. 
class. Ment. Hyg., NY, 1951, 2 d evalua- 

Torrance, E, P. Effects of induce of new 
tive sets on the development In E.P. 
ideas among graduate students. Second 
Torrance (Ed.), Creativity: Minnes 
conference on gifted children. Cente? 
polis: University of Minnesota, 
for Continuation Study, 1960. 


re- 
soc. 


(Received February 8, 1961) 


Journal of E 
‘ducational P: 
1961, Vol. 52, No. 4, ape aad 


po OF VISUAL DISCRIMINATION 
TRAINING WITH WORD AND LETTER 
STIMULI ON LEARNING TO READ A 
WORD LIST IN KINDERGARTEN 
CHILDREN 


SIEGMAR MUEHL* 
Iowa Child Welfare Research Station 


Abilj ' 
rey to discriminate among word 
ese ee to associating 
Word Daas. imuli with appropriate 
n : 
(inet ies study by the author 
Subjects. _. using kindergarten 
ition eee visual discrimi- 
Words ieee with the same 
list facilita read in a subsequent word 
When one ated learning the word list 
trainin, Mpared to discrimination pre- 
raining ir ith different words or pre- 
Tesults Miles geometric forms. These 
Specific ee the possibility that 
earned Pe serving responses were 
Word Biot the stimuli in the same- 
SPonseg Piabinearer and, that these re- 
Whit positively to dis- 
Vocab ing among the words in the 
i ulary task, 

Mental ne of the present experi- 
Specifically” was to determine, more 
Visual] a) the stimuli relevant to 
forms , eee among word 
ie or beginning readers. 
chilg fies a tested js that the 
Word: earns to discriminate among 
orm on the basis of the shape or 
m: of the whole word. ane a 
thee tested is that the chil if 
inates among words 0” the basi 
Buford Ww. 


x 
ah i Kk 
Gar, © author hes to than 
Cie oie eet of schools, ows 
Ys Marjorie Glatterbauel: Dona 
yiedt, and John Moores principals; 99 
Neney. Gooner, Asa, Neaien teacher 
w eee Carol Ripp, inthis pe a ber 
Sible cooperation ™ 


the word, 


of attending to some part of 
part of a 


such as a particular letter, 
letter, or letter grouping. 
The assumption that word shape is 
the basis for visual discrimination 
among words, although lacking objec- 
tive confirmation (Davidson, 1931), 
appears to be supported by educators, 
judging by the widespread practice of 
the so-called “hole-word” method of 
introducing reading in the first grade. 
According to Gray (1956, pp- 83-84) , 
this method is based on the assump- 
tion that, « 4ndividual words are 
the basic units of thought and recog- 
nition,” or, that, “. . -each word has a 
characteristic form by which it can 
be remembered.” Although words are 
the basic meaning units in our lan- 
guage, it does not follow they are also 
the basic units of visual recognition. 
The assumption that the beginning 
reader discriminates among words on 
the basis of letters finds support in 
two different areas of educational re- 
search. Comparing jntralist errors in 
word recognition tasks for children 
ranging from 4 to 6 years in age, 
Gates and Boeker (1923), Meek 
(1925), and Muehl (1960) found that 
letter similarities were the apparent 
source of confusion among words. Us- 
ing a camera to record eye move- 
ments, Buswell (1922) showed that 
beginning readers made many eye fix- 
ations associated with details within 
a word in the early reading stages. 
The results of psychological re- 


215 


216 


search bearing on the related question 
of part vs. whole discrimination are 
inconclusive. A review of this research 
with children, reported by Gibson and 
Olum (1960), showed that the re- 
search results were conflicting and ap- 
peared to depend on the stimulus ma- 
terial used. However, none of the 


studies reviewed had used word stim- 
uli. 


Merxop 
Pretraining Task 


To test these alternative assumptions, 
three groups of kindergarten children were 
differentiated by the kinds of visual dis- 
crimination or matchin 
received prior to learni; 


list. The pretraining pa: 


Stimuli 


SIEGMAR MUEHL 


vo of 

LD) was presented in four orders. eee 
these orders presented Consee Ue an 
stituted a two-trial block. The = treed in 
letters appearing in the sets were tT 
lower case with a primary-style ty ately 1 
The word stimuli were appron to in- 
mm. in length and 5 mm. in Bm an 
clude the tallest letters. These Mi center 
letter stimuli were mounted in the anyiDg 
of 24%” x 234” white cards. ecco jn the 
each two-trial block of stimulus item ultiple- 
three sets were two, 2%” x 82 al or 
choice response cards. The three simuls 
three letters, appearing on the these re" 
cards in each set also appeared on ition ol 
sponse cards, in two different Ser b 
ders. The apparatus for Pere een e- 
stimulus and response cards has pl, 1960). 
scribed in detail elsewhere (Mue re eat 
Briefly, it consisted of a tray for fe below 
the stimulus cards, a hinged win cai an 
the tray for exposing the TeEpO immedi 
three buttons located on a pane vords aP* 
ately in front of each of the three W ard. Hf 
pearing on the exposed response “esponse 
the subject selected the correct ig i 
word and pressed the button unde enter 
door chime would ring. The eee jme 
controlled the appropriate button i 
fonnection by means of a selector 
rom behind the apparatus. ing 

subjects received eight woe 
trials, A Presentation consisted of ee the 
Perimenter’s placing a stimulus card 0 


Responses 


——— 
All Groups 


ee 
Group RR» 
feu 


matching 


rjd 
Group LDe 


® Relevant shape-relevant letter. 


Reading Task 
Stimuli Responses 
All Groups 
feu blue 
geu red 
reu white 


x LD Same as above 
LD-O Same as above, plus oF 
enting instructions 


b Irrelevant shape-relevant letter. 
© Letter discrimination. 


VISUAL DISCRIMINATION PRETRAINING 217 


Pa agg tray for approximately 2 sec- 
at thisea ig ae was told to look carefully 
the Be - The experimenter then removed 
open: th and the subject was instructed to 
shite ae response card window, find the 
that tas item on the response card 
and then appeared on the stimulus card, 
to find i¢ es the button directly under it 
ist isk - choice was correct. If the sub- 
one of fee Ma incorrect button on any 
periment, ie ne three presentations, the ex- 
and had = howed the stimulus item again 
item and e subject find the correct response 
entations ting the bell. On subsequent pres- 
rectly aia the subject responded incor- 
approximat ue subject did not respond in 
closed the ely 5 seconds, the experimenter 
Sented the response card window and pre- 
menter alt next stimulus card. The experi- 
tWo-tria} arg response cards with each 
Pretrain ock. No rewards were given in 

ng other than provided by the 


Chime. 
» and yi s 
experimenter erbal encouragement from the 


Reading Task 
. Th , 
signed pontine or transfer task was de- 
differences ke both word-shape and letter 
NE words among the Group RR pretrain- 
2 amon, relevant stimuli for discriminat- 
Group IR. the reading task words. For 
task Was deseo raining words, the reading 
alone rel Cesigned to make letter differences 
"eading ieee to discriminating among the 
igm jg bo words. This reading task para- 
Own in Table 1. 

Ke ae Nonsense words were paired with 
nd Colo color patches. The typed words 
SY" whit patches were mounted on 342” X 
Presentatiog ate cards for paired-associate 
Was a apparatus used for the reading task 
ler is q, unter Card Master. The Card Mas- 
above €signed to present automatically the 
ture Plastic cards before a 3” X 5¥” aper- 
Shutte Y means of a timer unit and two metal 
trolled? the aperture opening cap be con- 
terval, to secure the following exposure - 
the cv: 22 anticipation interval to expose 
tio, Stimulus word alone; @ joint presenta- 
Interval to expose the word-color pair; 
& between-item jnterval when the aper- 


: A 4 
tine 18_completely covered, during which 
e . 
e ulus-response pair is 
Pro a, ind the shutters 


Pou rd. 
r orders of the three WO 

op Peared in fe reading task. Two of these 
"ders presented consecutively constituted 
a two-trial block. A plank card appeared at 


the end of each two-trial block. The follow- 
ing presentation intervals were used: an- 
ticipation interval—3 seconds; joint pres- 
entation interval—3 seconds; and a 
between-item interval—2 seconds. All sub- 
jects continued in the reading task to a 
learning criterion of 11 of 12 correct antici- 
pations in two successive two-trial blocks, 
or for a maximum of 20 trials. Subjects 
reaching a learning criterion before 20 trials 
were given perfect scores on the remaining 
trials. 
The reading task was administered im- 
mediately following pretraining. At the be- 
ginning of the reading task, the subject was 
shown each color patch with the word con- 
cealed and told the appropriate color name. 
The three color patches were then presented 
a second time and the subject asked to name 
the colors. If the subject could not give the 
correct color names he was eliminated. Fol- 
lowing the color naming, a familiarization 
trial with the three word-color pairs was 
given. The subject was told with each paired 
presentation that the word and color went 
together. After presenting the third pair the 
subject was instructed that the game was to 
guess the color that went with each word 
before the color appeared. The learning 
trials began following these instructions for 
subjects in Groups RR, IR, and for half the 
LD group. Since the LD group would not 
see the relevant letters as parts of words 
during pretraining, subjects in this group, 
presumably, would need to learn where to 
look for these letters in the reading task 
words, as well as learn to discriminate the 
relevant letter from the rest of the word. 
To control for learning where to look, the 
other half of the LD group, designated 
LD-O, was given special orienting instruc- 
tions at the beginning of the reading task. 
At the end of the familiarization trial, the 
experimenter told subjects in the LD-O 
group always to look at the beginning of 
the word and this would help them learn 
the right color. At the same time the ex- 
perimenter pointed to the beginning of the 
word. These instructions were repeated twice 
more, at the end of trial Blocks 1-2 and 3-4. 
To secure attention to the stimulus words 
as they appeared in the aperture during the 
reading task, the experimenter frequently 
pointed to the window with a pencil on 
each presentation. If the subject did not re- 
spond during the anticipation interval, the 
experimenter encouraged the subject to try 
to guess a color after looking carefuly at 
each word. ae: 
If a correct response occurred in the an- 


218 


ticipation interval, the subject moved a 
bead across a counting frame. If an incor- 
tect response or omission occurred, the ex- 
perimenter named the correct color in the 
joint presentation interval. The bead in- 
centive and occasional verbal encourage- 
ment from the experimenter served as re- 
wards in the reading task. 


Subjects 


The subjects wer 
kind 
City. The sub 
to the three 


Groups RR and IR, 30 to Group LD. Three 


three pretraining groups, 
bility, or mean age. Testing was done in the 
TABLE 2 


Mean Noumper or Correcr REsronsgs 
DURING VisvaL Discrmmnation 
PRETRAINING 


Blocks of two trials 
Group 


cad +6 7-8 
————|—____}_ 


4.27 | 4.60 | 5.03 5.10 | 19.00 
5.47 | 5.23 | 5.43 5.57 | 21.79 


® Groups RR and IR combined, 


Total 
WD: 
LD 


TABLE 3 
ANALYSIS OF VARIAN 


CE OF CorreEcr 
Resronses 1n Ercur Learning 


TRIALS For Groups WD AND 
LD in Visvau Discrr- 
INATION PRETRAINING 


(Two-trial blocks) 


Source df | MS PF 2 
Between-Subjects 59 
Groups (B) 1 |27.34111.02] <.005 
Error e i 2.48) 
ithin-Subjects 
Witriale (A) 3 | 3.19] 4.25)<.01 
AXB 3 | 1.98] 2.641<.10 
Error (w) 174 75) 
Total 239 


SIEGMAR MUEHL 


e De- 
public schools during November and 


cember. 


RESULTS 


Pretraining 


he 
Comparing Groups RR and = 
mean number of correct boi She 
were 19.40 and 18.60, nepeetr= ae 
standard deviations 4.22 and . ‘Type 
spectively. A Lindquist (195! “etraill” 
T analysis of variance of the ae ofits 
ing scores showed no reliable pet oups- 
ance differences between these bine 
The trials effect for the com indi- 
groups was significant (p < See 
cating a learning effect. Since bee 
were no performance aaa site 
tween Groups RR and IR in pre tion 
ing these two word discrimina a 
groups were combined and gp ee 
as Group WD. Comparing a 
D and LD in pretraining, the 70 
number of correct responses were . an 
and 21.70, respectively; the stan! ly: 
deviations 3.91 and 1.98, respective 
pautley’s test (Walker & Lev, 198 
PP. 193-194) for homogeneity of beet 
ence between the groups was ace 
fant (p < 01), leading to the ae 
tion of the nul] hypothesis. The sa 
Numbers of correct responses ba 
Groups WD and LD during pretrain 
: ented in Table 2. ‘ a 
¢ indquist (1956) Type. 
iene variance for these data } 


differences in group mean 
eee pas effect were significal 


>t variance). A furthe! 
Tmple effects showed that 
learning effect was el” 


8ccounted says D 
0a}: for within the W 


Reading Task 

CS 5 
mean nunae "oups RR and IR, the 
were 33.07 ang on “°rrect responses 


2 respectively ; the 


VISUAL DISCRIMINATION PRETRAINING 


TABLE 4 


Mran N 
em Sree or Correct Responses 
A. NDARD DEVIATIONS FOR 
SON THE Reapine Task 


(Groy 


wanda 
S 3 
| -pettively. A Lindquist (1956) Type 


i 
(nated as Group 


iy ONses for Groups 


Group Mean SD 

WDs 

LD 33.00 11.67 

LD-o 29.53 12.75 
7 32.53 14.59 


a 
Gi 
Tours RR and IR ‘combined. 


w a a 
2 — 


MEAN NUMBER OF CORRECT RESPONSES 
© 


219 


learning curves in the reading task for 
the three groups are presented in Fig- 
ure 1. A summary of a Lindquist 
(1956) Type I analysis of variance 
for these data showed no significant 
overall performance difference for the 
groups on the reading task. Compar- 
ing Group LD with Groups WD and 
LD-O separately showed no reliable 
performance differences (F < 1). Per- 


TYPE OF PRETRAINING 
WORD DISCRIMINATION (WO) 


LETTER DISCRIMINATION (LO) 
LETTER DISCRIMINATION= 
ORIENTING INSTRUCTIONS (LO-O) 


2 


° ———— po tt 
Ja 56 7-6 S10 N42 1344 1546 178 19-20 


BLOCKS OF TWO TRIALS 


F, 


Ta, 1. 
Wor ( ante effects of visual discrimination p 
‘coup WD: Groups RR and IR com! 


PS LD and LD-O) on correct responses in 


d deviations, 11.58 and 11.75, 


a : 
ysis of variance showed Groups 


in os nd IR did not differ significantly 
tag, “tall performance on the reading 
the 
Sho 


- A further comparison between 
8roups on the first trial block also 
on no reliable differences. 
1 the basis of these findings, the 
© word-discrimination pretraining 
Cups were again combined and des- 
ask in the reading 
Sk, T 

he mean num te Po aad 


-O are presented jn Table 4. The 


retraining with relevant letters contained in 
bined) vs. relevant letters presented alone 
learning to read a three word list. 


formance differences on the first trial 
block also were not significant. 


Discussion 
If word shape is the relevant visual 


aspect for discriminating among words 
for beginning readers, Group RR pre- 
training should have resulted in trans- 
fer of discrimination learning to the 
reading task and faster learning of the 
list relative to Group IR pretraining. 
On the other hand, if the relevant 
per of correct re- letters alone provide the basis for vis- 
ual 
Groups RR and IR pretraining should 


discrimination in pretraining 
7 


220 


SIEGMAR MUEHL 


have resulted in equal transfer of dis- 
crimination learning and no perform- 
ance difference between the groups on 
the reading task. 

The findings based on comparing 
Groups RR and IR on the reading task 
are consistent with the assumption 
that beginning readers discriminate 
among similar length words of differ- 
ent shape on the basis of specific let- 
ter differences. Further investigation 
is necessary to determine the effects of 
at least two other word-stimulus vari- 
ables on visual discrimination learn- 
ing: word-shape differences defined in 
terms of differences in word length and 
location of the relevant letter within 
the stimulus word, i.e., in other posi- 
tions than at the beginning of the 


word. It is possible that placing the 
relevant letter in so 


me middle posi- 
tion in the word would increase the 
difficulty of the letter discrimination. 


This effect could increase the impor- 
tance of shape or length cues as bases 
for discrimination, 

The findings based on comparing 
Groups WD (Groups RR and IR com- 
bined), LD, and LD-O on reading 
task performance requi 


re an explana- 
tion to account for the appearance of 
learning differences between the word 


and letter discrimination groups in 
pretraining, and the absence of learn- 
ing differences on the Treading task, 
Learning differences in discrimination 
pretraining indicated differential digi- 
culty in discriminating letters when 
these letters were presented singly as 
compared to being presented in words, 
Absence of learning differences on the 
reading task indicated that the addi- 
tion of irrelevant letters to the pre- 
viously differentiated single-letter 
stimuli for Groups LD and LD-9 did 
not increase the difficulty of discrimi- 
nating these letters. ot 

To account for the relative difficulty 
of the WD group discrimination learn- 


a that 
ing in pretraining, it is oe 
lization among the 10 is 
generaliz f the 4! 
words increased as a result 1 ments— 
dition of similar stimulus ele elevan 
the same two letters—to the ‘Thus to 
letters in each of the words. ds 
discriminate among the ee a se f 
group subjects had to — either to 
observing responses relate letter i 
the location of the relevant relevant 
the words, discriminating the omaple 
letter from the total stimulus t es 0 
or a combination of the two tyP 
observing responses. ? 
On the eee of this aarorplin: or’ 
ever, differences in learning P' in the 
ance should have appeare the L 
reading task, at least io pe that 
group. The assumption woul acau 
the LD group would need to ¢ 
the same orienting responses read- 
relevant letters included in = lear?“ 
ing task words. This additiona terfer® 
ing would be predicted to iD word 
with ‘the acquisition of the "i, 
names. The findings did not ¢0 
this expectation. of * 
© account for the absence epee 
WD-LD group performance meee 
on the reading task, two explana jo 
are offered. The simplest explana” ad 
is based on the greater difficulty The 
variability of the reading task. ity 
merease in performance variabi fer 
could mask any differential tran® en 
of discrimination learning betwee 
the Pretraining groups. A second 
planation requires the assumptl0" 
that exposure to the relevant lett! 
in the LD group pretraining resu e 
In an “acquired distinctiveness” of th 
letter stimuli, not detected by the pre 
training response measure, that trans” 
ferred to the reading task. Transfe? 
of this acquired distinctiveness cou. 
facilitate the acquisition of the ob- 
Serving responses nece 16° 
criminate the relevant lotta, Roe 
in the reading task Words—the same 


how- 


e 


——tt 


VISUAL DISCRIMINATION PRETRAINING 


obseryj 
Slotaiy te Sains acquired more 
ing, y the WD group in pretrain- 
a. 
Phere explanation suggests the 
cluded (ny that although letters in- 
discrimin, words are more difficult to 
Sented inate than the same letters pre- 
in lear hm this relative difficulty 
Words fe to discriminate letters in 
Visual ae be overcome by providing 
i ice ee training with 
em ant letters prior to presenting 
urth barts of words. 
this hy er research is necessary to test 
ers aetna bei the relevant let- 
sas ed in ot 
Position in the 7 ‘a than the first 


SumMMaRY 


The 
Ce Ss ly of this study was to 
1 diserim e stimuli relevant to vis- 
T ben nation among word forms 
OF winning readers. 
Ten, Fin groups of kindergarten chil- 
taining received discrimination pre- 
Ah | words. For one group, 
Structeq raining words were con- 
ifferen So that both shape and letter 
Van ces among the words were rele- 
Words hs ee meteiias among the 
nd gro he reading task. For the sec- 
®Mong ‘- only the letter differences 
Televant € pretraining words were 
Teadin to discriminating among the 
"iva ene words. A third group Te- 
Ne re] iscrimination pretraining with 
T evant letters alone. 
Nee a were no significant perform- 
isep; ifferences between the two word 
tray IMination groups 0D either & pre- 
Sisto, Ng or reading task, @ finding con- 
dros tt With the assumption that ail 
length discriminate similar 
ASis of specific letter ‘ 
raj hen the letter disct! 
ining group was comP 


mination pre- 
ared with the 


221 


combined word discrimination pre- 
training groups, there was a perform- 
ance difference favoring the letter dis- 
crimination group in pretraining, but 
no differences on the reading task be- 
tween the groups. These findings sug- 
gested the hypothesis that although 
letters included in words are more dif- 
ficult to discriminate than the same 
letters presented singly, this relative 
difficulty in learning to discriminate 
letters in words can be overcome by 


providing visual discrimination train- 
ing with the relevant letters prior to 
presenting them as parts of words. 


REFERENCES 


Buswett, G. T. Fundamental reading hab- 
its: A study of their development. 

Suppl. educ. Monogr, 1922, No. 21. 

Davipson, Heven P. An experimental study 
of bright, average, and dull children at 
the four-year mental level. Genet. psy- 

chol. Monogr., 1931, 9, 119-289. 

Gates, A. I., & BorKer, Enorse. A study of 
jnitial states in reading by preschool 
children. Teach. Coll. Rec., 1923, 24, 
469-488. 

Gipson, Eueanor J., & Ouum, Vivian. Ex- 
perimental methods of studying percep- 
tion in children. In P. H. Mussen (Ed.), 
Handbook of research methods in child 
psychology. New York: Wiley, 1960. Pp. 
311-373. 

Gray, W. S. The teaching of reading and 
writing: An international survey. (Mon- 
ographs on Fundamental Education, 
No. 10) New York: UNESCO, 1956. 

Linpquist, E. F. Design and analysis of ex- 
periments in psychology and education. 
Boston; Houghton Mifflin, 1956. 

Meex, Lois H. A study of learning and re- 
tention in young children. Teach. Coll. 
Contrib. Educ., 1925, No. 164. 

Musxt, S. The effects of visual discrimina- 
tion pretraining on learning to read a 
vocabulary list in kindergarten children 
J.educ. Psychol., 1960, 51, 217-221. 

Water, Heten M., & Lev, J. Statistical in- 
ference. New York: Holt, 1953. 


(Received February 9, 1961) 


Psychology 
and Mental Health 


A report of the Institute on Education and Training 
for Psychological Contributions to Mental Health, 
held at Stanford University in August, 1955. 


Edited by Cuarurs R. STROTHER 


Topics discussed at the Institute: 


Training Needs of Psychologists in Com- 
munity Mental Health 

Specialization in Training 

Practicum Training 

Training for Therapy 


Training for Research 


in the Mental 
Health Field 


Problems of Departmental Organization 


Price: $1.7¢ 


ee 


Order from: 


American Psychological Association, Inc. 
1333 Sixteenth Street, N. W 
Washington 6, D. C. 


meee ee 


JOURNAL OF 
EDUCATIONAL PSYCHOLOGY 


Volume 52 


October 1961 


Number 5 


P 
ERSONALITY FACTORS AMONG SCIENCE AND 
TECHNOLOGY FRESHMEN’ 


EUGENE B. NADLER ie 


¢ . 
ase Institute of Technology 


In spi 
the fener, of numerous criticisms of 
ummariz of personality types, ably 
485488) rf by Hilgard (1957, pp. 
in Psychol € notion has proved useful 
der the 1) 02% 2nd anthropology un- 
asic tin of syndrome, life style, 
Personalj ‘etic | structure, modal 
inson, lore ete. (See Inkeles & Lev- 
"eporteq ot for example). The study 
Such a, go cre had as its starting point 
+6 Fea one which could be 
Students illerentiate among freshman 
aay a school of science and 


e 
us oi was upon two dichoto- 
Buisheg sonality variables _distin- 
228) ea tay (1938, pp. 222- 
XO¢athe taception-Intraception and 
firs refe Ction-Endocathection. The 
Son’g j rs to the direction of the per- 
While in, erest, outward or inward, 
things € second refers to its objects, 
» or ideas. Taking all combina- 


rT 
Thi 

a studigesearen was part of a larger series 

Stted i 0n curriculum evaluation supP- 

. Y a grant from the Carnegie Cor- 


i 
Me mati of New York. Additional support 
earch € available through a faculty re- 
by th fellowship awarded to G. K. Krulee 
The © Social Science Researc 
authors gratefully acknowledge the 
itut, and encouragement of the 
are ate of Technology administration. We 
10 0 very much in debt to Barbara Doty 
of 4,28sisted in the collection and analysis 
Coop © data and to Morre! Heald for his 


GILBERT K. KRULEE 
Northwestern University 


tions of both the direction and the ob- 
ject of interest led Murray to describe 
the following four types: 

1. Extraception-Exocathection: To 
adapt to the world as it stands; to be 
interested in tangible results; to be 
very practical; to amass a fortune. To 
secure a permanent position; to be- 
come a member of clubs and institu- 
tions; to be without illusions; to con- 
serve established values. To work 
effectively with mechanical appliances. 

2. Intraception-Exocathection: To 
live imaginatively; to dramatize the 
self; to express one’s sentiments and 
beliefs in action. To initiate and fur- 
ther progressive social movements. To 
speak against abuses; to propose re- 
forms. To concoct new schemes, busi- 
ness ventures, political innovations; 
to be guided by a vision of the future. 
To seek adventure; to become involved 
in amorous affairs. 

3. Extraception-Endocathection: To 
be interested in ideas and theories 
about substantial events (for example, 
physical sciences). To reflect and write 
about external occurrences and sys- 
tems; history, economics, government, 
education. To collect data and think 
inductively. 

4, Intraception-Endocathection: To 
devote oneself to artistic or religious 
representations. To dream, brood, and 
sae aa to bose absorbed in the 
attemp resolve i i 
spiritual wena. soniipiy and 


™ are the authors’. ‘ 
223 


224 


The study was concerned with the 
following tasks: 


1. testing the hypothesis that signifi- 
cant numbers of students in the school 
would fall into the two extraceptive 
categories, and very few into the two 
intraceptive categories 

2. placing the predicted finding in a 
broader societal context by determin- 
ing what socioeconomic influences, if 
any, might be reflected in the two 
varieties of extraception 

3. describing further the differences 
between the extraceptive students who 
were exocathecting on the one hand 
and endocathecting on the other in 


terms of their career goals and life val- 
ues 


4. examining the influence of these 
orientations on the students’ evalua- 
tion of their courses and curricula, and 
on their academic performance 


Mernop 


Differentiating the types. The following 
questions were included as part of a larger 
questionnaire* presented to 432 male fresh- 
men about three-quarters of the way through 
their first semester : 

1. Once in my life I would like to: a. 
develop a valid scientific theory, b. dis- 
cover a satisfactory principle of ethical 
conduct, c. invent a marketable product, 
d. convince a legislature to enact one of 
my views into law. 

2. I would like a job where: a. I could 
express my social views through my work, 
b. I could organize the thinking in m 
field in a systematic way, c. I could dis- 
cover my true potentialities, d. I could do 
work of practical importance. 

3. I am filled with awe when I consider 
the lives of: a. Vincent Van Gogh, Ma- 
hatma Gandhi, Albert Schweitzer; b, 


2A copy of this questionnaire has been 
deposited with the American Documentation 
Institute. Order Document No. 6834 from 
ADI Auxiliary Publication Service, Library 
of Congress; Washington 25, D. C., remit- 
ting in advance $2.00 for microfilm or $3.75 
for photocopies. Make checks payable to: 
Chief, Photoduplication Service, Library of 
Congress. 


EUGENE B. NADLER AND GILBERT K. KRULEE 


drew Carnegie, 
ane, 
incoln; ©» 
Tu Albert 


Charles Kettering, 
Winston Churchill; 
Woodrow Wilson, Abraham 
Frank Lloyd Wright, John Dewey, 
Einstein. 3 2a: 
4. In my spare time I would like be 4 
teach woodcraft to a Boy Scout ages 
participate in a campaign for different 
schools, c. discuss and evaluate sycho 
political systems, d. learn about P' 
analysis. sa Hilt 
F Beivotis is a little of everythins: | 


tical 
by and large, I am mostly: en 
b. idealistic, c. thoughtful, d. ma four al: 


For every question, each of t the four 
ternatives corresponds to one of the five 
personality types. If in answering ore alr 
questions a student chose three or - type 
ternatives characteristic of a SiD8 “ance 
he was placed in that type. The = 10, 
probability of this happening was a “pare? 
calculated by collecting the firs (Vs 
terms of the binomial expansio? 
”). jonnaite 
Questionnaire. The same question ch 
included a large number of item class 
covered the general areas of soci’ ios, 
background, career goals and life ve 
and evaluation of curricula and course 
Academic performance. At the end a oO 
first freshman semester, two measure int 
achievement became available, grade P 
averages and the number of separations: 


ReEsuuts 


Of the 432 students surveyed, he 
or 35% were classed as ExtraceptV 
Exocathecting, and 83 or 19% tet 
classed as Extraceptive-Endocathe?” 
ing. Only 15 men fell into the +” 
traceptive - Exocathecting catego!” 
(3.4%), and only 9 into the IntraceP 
tive-Endocathecting category (2.1707 

he miscellaneous or mixed grove 
(those with fewer than three respons® 
in their largest category) consisted "el 
182 individuals, comprising 42% of th 
Sample. Thus, the two ExtraceptiV? 
groups who were predicted to ma 
Up a significant proportion of the sa™* 
ple together accounted for about 52% 
of = male var only insignifica? 
numbers in the Intr 4 eg 
ies. The rest of this sive, catees 


i : 
solely concerned with oe ae 


PERSONALITY FACTORS 


r 

— among the Extraceptive men, 

i age ie and the Endo- 

the pra who will be referred to as 

men, a og ze the theoretical 
Tespectively, fo F 

descriptive tin) eels and 


Res 
octal Class Background 


ay 
a of the subjects to a 
OCcupati ealing with their fathers’ 
ciebations at the time of their bith 
cir fat] 0) and to one dealing with 
of ies occupations at the time 
1959) con the questionnaire (early 
categories. sorted into the following 
am; (b) (a) unskilled, semiskilled, 
White eo] skilled; (c) foremen; (d) 
; > sales, government, mili- 
Preneur} Ba ubprofessional; (f) entre- 
Shows i > (g) professional. Table 1 
ach sar percentage distributions of 
e ol ip of fathers at the two times. 
between As pin test of independence 
ical vers he two distributions, prac- 
Was 15 ng theoretical fathers in 1940, 
The difta? Significant at the .02 level. 
tibutions between the two dis- 
Portion + bees due to the higher pro- 
Workers unskilled, semiskilled, farm 
therg of : and foremen among the fa- 
atively the practical men, and the rel- 
lar, sal higher proportion of white col- 
Worker. €8, government, and military 
letical S among the fathers of the theo- 
Men, 

batios distributions of fathers’ occu- 
ent oe were still significantly differ- 
Signif, 1959, chi square being 84.50, 
Bowen’ beyond the 001 level. 
ever, the major difference here 


ent due to the larger proportion of 
the ;preneurs among the fathers of 
theoretical men. 


Omparing each grouP with itself 
the two 5 eae ont times showed a 
thor shift among the practical fa- 

€rs from unskilled, semiskilled, and 
white collar, sales, 


arm ; 
n é 
wey litary occupations 


(0) . 
Vernment, and mi 


225 


TABLE 1 
Fatuer’s OccupaTION AT DIFFERENT TIMES 
Practical |Theoretical 
men men 
Father’s occupation (%) (%) 
1940 | 1959 | 1940 | 1959 
Unskilled, semiskilled, | 36 | 18 20 | 11 
farm 
Skilled 11 | 13 | 16 | 19 
Foremen s/ 6) 9) 2 
White collar, sales, gov- | 18 | 35 29 | 30 
ernment, military 
Subprofessional ai 2) 7) 8 
Entrepreneurial 8 | 12 | 12 | 26 
Professional 16 | 18 | 17 | 14 
pad ’ 
Chi square comparisons; 
Practical vs. theoret- | 15.97 02 
ical: 1940 
Practical vs. theoret- | 84.50 001 
ical: 1959 
Practical: 1940 vs. 18.56 01 
1959 
Theoretical: 1940 vs. 11.16 07 
1959 


in 1959; the chi square between the 
1940 and 1959 distributions was 18.56, 
significant beyond the .01 level. The 
fathers of the theoretical men, on the 
other hand, showed a major increase 
in the entrepreneurial category; the 
chi square of 11.16 was not significant, 
but was close enough to significance 
to indicate a slight trend, p being 
about .07. It is interesting that the 
occupational distribution of the fa- 
thers of the practical men in 1959 is 
almost identical with that of the fa- 
thers of the theoretical men in 1940. 
There were no significant differences 
jn fathers’ incomes between the two 
groups, and the omission of almost 
40% of the responses on questions 
dealing with parental education in- 
validated any conclusions that might 
have been drawn from these data. 


226 


Career Goals and Life Values 


In order to analyze the curriculum 
choices of the students, the school cur- 
ricula were divided into science and 
engineering. The one curriculum which 
did not fit into either category, man- 
agement, was safely excluded since 
only one practical man and one theo- 
retical man chose it. The science cur- 
riculum comprised math, physics, 
chemistry, and engineering science. 
The engineering category included 
metallurgical, civil, chemical, elec- 
trical, and mechanical engineering. In 
this way a two by two table was con- 


TABLE 2 


Cuaracrenistics or Jon AND Lire Goats 
Cuosen as Important 


j- | Theo- 
Characteristic of life goal aenctl retical 
Ideal job: 
Management, 17% | 4% 
Equipment design 14 4 
Graduate school 30 73 
Characteristics of ideal job: 
Use of special abilities 62 81 
High income 50 28 
Opportunity for creativity | 51 76 
Freedom from supervision 30 44 
Ideal position at peak of 
career: 
Science 25 41 
Administration 50 27 
Teaching nS 9 
Personal characteristics 
thought important: 
To be liked by people 47 27 
To understand other people} 56 35 
Good grounding in basic | 59 73 
theory 
Practical knowledge of facts| 78 61 
Ability to convince and | 40 24 
persuade 
Organizing and adminis- | 47 30 
trative ability 
Activities expected to give the 
most satisfaction: 
Career 22 36 
Family ; 66 41 
Community and organiza- | 2 10 
tions 


EUGENE B. NADLER AND GILBERT K. KRULEE 


' c- 
structed with personality nS Te 
tical or theoretical, on one marg : sci- 
curriculum choice, Caen was 
ence, on the other. The chi Sn level. 
59.24, significant beyond the - ractical 
Eighty-four percent of the 16% sci- 
men had chosen agree ee theo- 
ence, as opposed to 67% of science 
retical men who had chosen § 
and 33% engineering. ‘ 

The distribution of pan 
ideal job upon graduation for ti ent 
tical men was significantly n, chi 
from that of the theoretical t peyon 
square being 41.25, significant ‘ces £0 
the .001 level. The order of ener , 
the practical men was graduate n, ap 
management, equipment eos - 
plied research, new product : treat 
ment, and bringing up the “— and 
basic research, production, ae te the 
teaching. The order of choices schoo 
theoretical men was graduate see pe- 
first, and all other alternatives £® 
hind. + aq) jo? 

The requirements of the idea ut 
also differentiated the two groups. 50 
of a list of 10 possible requiremer for 
the ideal job which were checke smi 
“high” priority, four items diset ne 
nated between the two grout a 
probabilities ranging between 05 as 
-001. Among the practical men it rs 
more important to earn a good ira 
money, whereas among the theorett 
men it was more important to er 
the opportunity to use their spe ig 
abilities and aptitudes, to be creat 
and original, and to be relatively i 
of supervision by others. Out of for 
Same list which was rechecked 
items having “medium” priority; be 
items separated the groups, the pro 
abilities ranging between 05 and .0% 
Among the practical men it was moc" 
erately important to have the oppor 
tunity to use their special abilities at“ 
talents, to be creative anq originals 
and to exercise leadership, whereas 3 


s of the 
e prac- 


PERSONALITY FACTORS 


wa : . 
meet C ately important for the the- 
roe oe i to have a chance to earn 
a. eal of money and to look for- 

depen stable, secure future. 

occupation’ to a question about the 
ideal at ae positions which would be 
classified le peak of their careers were 
He steiner ne categories of scientific, 
and othe ative, ownership, teaching, 
eres re a don’t know. The chi 
iS tvs ' of independence between 
nificare etibutions was 78.17, sig- 
% Wenn the .001 level. Among 
Was ie men the order of choices 
ship, and oe scientific, owner- 
retical i caching. Among the theo- 
en this order was scientific, 


adminj 
~MNistratiy, a 
Ship, tative, teaching, and owner- 
The 
er sags F 
the tou, ‘sonal characteristics which 


Vocational wes perceived as promoting 
di lenses Success also showed some 
Sential gn Of 17 items checked as es- 
fitpiicied oe success, six dis- 
it evels the two groups at probabil- 
Praction) Tanging from .05 to .01. The 
lik, men felt it important to be 
People te) to understand other 
Stade oth be able to convince and per- 
knowleq er people, to have practical 
and to Se of the facts in one’s field, 
trative ave organizing and adminis- 
Nore car fe The theoretical men 

Od gr €n felt it necessary to have a 

é Ounding in basic theory. 

g wiih series of four questions deal- 
Or th the importance of planning 
knows, future, getting ahead in life, 
fop ledge of one’s academic per- 
cho, 4Nce in comparison to others, and 
en 2S among being successful, in- 
Pendent, or well liked showed one 


fons 
Ps nificant difference between the two 
h i the .01 level. 
© practical men referred success 
.Ndependence, °n ¢ independence to 
enue well liked, where@5 the theoreti- 
BR Men preferred independence to suc- 
S8, and success to beiné well liked. 


227 


The activities which were expected 
to give the most satisfaction in life 
showed the practical men overwhelm- 
ingly making family relationships a 
first choice, followed by career ac- 
tivities and leisure time activities. The 
theoretical men were about evenly di- 
vided between family relationships 
and career activities, with some senti- 
ment for activities directed toward 
national and international betterment, 
and church activities. The chi square 
between the two distributions was 
17.74, significant beyond the .01 level. 


Evaluation of Curricula and Courses 


In 5 out of 13 ratings describing the 
nature of the ideal curriculum, the 
two groups presented different con- 
ceptions; four of these differences were 
significant beyond the .001 level and 
one at the .01 level. The practical men 
thought the ideal curriculum should 
emphasize practical aspects of the ma- 
terial, relate directly to the kind of 
work they would be doing, emphasize 
facts and techniques that have direct 
application, leave enough free time for 
other activities, and prepare them for 
taking responsibilities in management 
and administration. The theoretical 
men thought that the ideal curriculum 
should be mostly concerned with 
theory, provide them with a broad 
and general knowledge of the field, 
emphasize the ability to think logi- 
cally and abstractly,. be difficult 
enough to require real effort, and not 
have to prepare them for taking re- 
sponsibilities in management and ad- 
ministration. 

Two out of 13 ratings describing 
what the students realistically ex- 
pected of their curricula distinguished 
the two groups, one being significant 
at the .001 level and one at the .01 
level. The theoretical men expected 
their chosen curriculum to be mostly 
concerned with theory, but did not 


228 


TABLE 3 


Mean Ratinas or Items RELATING TO 
THE CURRICULUM 


Theo- 
Tetical 
men 


Practi- 
cal men 


Ttem® 


The ideal curriculum should: 

1. be mostly concerned 
with theory (1)> 
emphasize practical as- 
pects (8) 

2. provide broad knowl- 
edge (1) 
relate directly to future 
work (8) 

5. emphasize the ability 
to think (1) 
emphasize facts and 
techniques (8) 

6. require real effort on 
my part (1) 
leave me with free time 

(8) 
10. prepare me for man- 
agement, (1) 
not prepare me for 
management (8) 
realistically expected 
curriculum would: 

1. be mostly concerned 
with theory (1) 
emphasize practical ap- 
plications (8) 

10. prepare me for manage- 
ment (1) 
not prepare me for 
management (8) 

In my least valuable course: 
1. practical applications 
are emphasized (1) 
the primary emphasis 

is on theory (8) 

3. students influence what 
is discussed (1) 
instructor decides 
what is discussed (8) 

11. the instructor is im- 
partial (1) 
the instructor is not 
impartial (8) 


5.72 | 3.74 


4.15 | 3.15 


4.42 | 2.69 


3.78 | 2.57 


2.92 | 4.16 


My 


4.80 | 3.32 


3.70 | 4.70 


5.10 | 3.62 


5.57 | 6.86 


3.13 | 2.36 


® A}l items are paraphrased. 
b Numbers in parentheses are the extreme scale 
values. 


expect it to prepare them for taking 
on responsibilities in management and 
administration; the opposite was true 
of the practical men. 


EUGENE B. NADLER AND GILBERT K. KRULEE 


Neither the nominations to 
most valuable course, nor the canbe 
tive ratings of the most va ] men 
course distinguished the practic 
from the theoretical men. ae 
: On the other hand, the noo 
for least valuable course were uate 
ent for the two groups, the chi re less 
of 12.926 having a probability res 
than .05. Whereas 44% of ca 
retical men nominated graphics of the 
least valuable course, only 17% hand, 
practical men did. On the other Hat 
30% of the practical men chose 28,08 
istry as their least valuable ~ a on 
opposed to 24% of the theorete equa 
Other courses were chosen with s, ID 
relative frequency by both ee Pest 
3 out of 14 ratings describing the pre- 
valuable course, the two groups sige 
sented different evaluations, ~ ie 
nificant beyond the .005 level. leas 
theoretical men described theiT © 
valuable course as primarily pee | 
Sizing practical aspects of the ma 
and were more vehement in expres* 0 
the idea that the instructor al to 
allow the students to decide wha 150 
discuss in class. However, they ind 
felt he was more impartial in dea? 


with individual students than did 
practical men. 


Academic Performance 


The grade point average of the pee 
tical men at the end of the first sagen 
ter was 1.034 on a 3-point grading a. 
tem which included 2 negative quali y 
Points for F. The average of the hao 
retical men for the same period ha 
1878. The difference between thes 
two means was significant beyond the 
irae for a two-tailed test, t peing 

991, 


Since it was possible that the ob- 
tained difference in school achieve’ 
ment was due to differences in scholas~ 
tic aptitude between the two groups: 
they were compared with respect to 
their performance on two tests of the 


PERSONALITY FACTORS 


Coll 
epee Examination Board 
ne eae all freshmen. There was 
sie cant difference between the 
On the a o the Math Aptitude test. 
retical Bid al Aptitude test, the theo- 
ie fant, scored slightly higher than 
ang at iy men; ¢ was 1.83, signifi- 
test buts, ‘ 05 level for a one-tailed 
oo ot for a two-tailed test. Thus, 
for the iC a weak support at best 
ment of th 1on that the higher achieve- 
isher theoretical men was due to 

sera aptitude. 

vas no significant difference 
Apaiee between the two groups 
Percenta tt the first semester, the 
Men an aos eing 22% for the practical 
owever % for the theoretical men. 
Sonality » On the hypothesis that per- 
nici ty Mi arma or ego syn- 
va Bes Levinson, 1959) might 
Subjec € in the separation rate, the 
were divided between those 
those wy made compatible choices and 
chose ae not. Practical men who 
Tetic. ime ee curricula, or theo- 
ere a who chose science curricula 
Patible oe to have made com- 
ae sie Olces; practical men who 
chose @ €nce, and theoretical men who 
have oe were considered to 
174 eae incompatible choices. Of 
Choices who had made compatible 
Rand 30 or 17% were separated at 
Of of the first freshman semester. 
ica. who had made incompatible 
f ot 17 or 34% were separate’ at 
py, ©2d of the first freshman semester. 
tione vlerence between these propor- 


level, Was significant beyond the . 


in se. 


Who ha, 


Discussion 
etween these 


T i 
ns he major differences b 
be 12°¢Ptive freshmen may perhaps 
ee est understood in terms of the 
a lal psychological conce 
a tety and status security. * i 
TN of responses t0 the questionnaire, 
especially the data oD father’s oc- 


229 


cupation, suggests that the practical 
men were relatively more anxious 
about their status, while the theoret- 
ical men were relatively more status 
secure. The orientations of the prac- 
tical men tended to serve the ends of 
upward status mobility in modern in- 
dustrial society; those of the theoret- 
ical men gave greater personal and 
horizontal mobility. The courses and 
curricula of the school were evaluated 
in terms of whether they met the re- 
quirements of these orientations. 

In the days when the differences be- 
tween science and engineering were 
sharp and clear these orientations 
might have had approximately equal 
viability. Boys who liked to work with 
their hands could go into engineering ; 
thoughtful boys could go into science. 

The trend in recent years, however, 
has been for the engineering curricula 
to become more like the science cur- 
ricula, to emphasize basic scientific 
principles and broad theoretical skills. 
The first year curriculum facing the 
students in this study had this marked 
theoretical character, and was identi- 
cal for all of them. As such, it was less 
ego syntonic for the practical men 
than for the theoretical men, which 
probably accounts for the lower grade 
point average of the practical men at 
the end of the first semester. 

In the first year, all the practical 
men faced a program that was not ego 
syntonic. However, for those who had 
chosen an engineering curriculum, the 
first year was at least a stepping stone 
to a program that was more compat- 
ible with their needs. No such hope re- 
mained for those practical men or 
those theoretical men who had made 
incompatible curriculum choices, since 
the feedback from upperclassmen and 
faculty members regarding the nature 
of the second, third, and fourth year 
programs could only increase the con- 
flicts arising from their incongruous 
decisions. Thus, at the end of the first 


230 


freshman semester, those who had 
made incompatible curriculum choices 
had double the separation rate of those 
who had not. 

What practical recommendations 
may be made from these considera- 
tions? 

In general, the specific recommenda- 
tions tend to fall under two main head- 
ings: (a) techniques designed to help 
students make ego syntonic choices at 
the outset, and (b) techniques de- 
signed to make it easier to change 
from an incompatible program to one 
that is compatible. 

The present state of personality 
measurement probably does not per- 
mit that the research tool used here to 
classify freshman subjects could also 
be used to select prospective freshmen, 
even if it were greatly improved. On 
the other hand, such an instrument 
would have great value in aiding the 
preliminary counseling of freshmen. 
Although the subjects of this study 
were not required to declare a choice 
of curriculum until the end of the first 
year, and had an opportunity to re- 
vise this choice even afterward, it is 
clear that sometime during the first 
semester most of the students had 
made a fairly binding decision, since 
almost all of them answered the ques- 
tion on curriculum choice, whereas 
other questions went unanswered, A 
guidance instrument would prove of 
value in clarifying for the student the 
consequences of given curriculum de- 
cisions, and could aid in reversing 
seemingly binding decisions which 
were incompatible with the student’s 
interest. 

As a further way of improving the 
initial choices of freshmen, the pub- 
licizing departments of technica] 
schools could step up a campaign to 
present a more realistic picture of what 
a modern technical program entails, 
focusing on the emphasis on theoret_ 


EUGENE B. NADLER AND GILBERT K. KRULEE 


- dicates 
ical skills. The present study aan 
that, by and large, there 1s a en who 
of self-selection in the anton that, 
come to a technical school, ae n is 
by and large, this : self-selec ae e 
highly realistic. An ENT an 
licizing campaign could impr stem. 
already effective ealt-easeee cio 
It also might have the effect 0 ee in- 
ing students who might sagen in- 
vest their energies elsewhere, lection 
fluencing another side of the se 
process. 

There remains the . 
changing incompatible choice wessio? 
they have been made, Our imp? have 
is that the student’s eee fam 
been accepted by members of aes s0 
ily and his friends to a high deg' me 
that part of his identity has <r has 
rest on the expectations that } ar 
induced in others. It would app nte! 
be desirable for the school to ae 
this social support, to create a eli “ 
in which indecision, hesitancy, to 
ambivalence about curriculum 1s d is 
erated, in which shopping nce ee 
encouraged, and in which oeamee 
choices may be more easily postP? 
or changed. «ah ate 

A second recommendation whit ich 
tacks this same problem is one W! “er 
is already being carried out in some e 
Spects. With the breakdown of é 
traditional boundaries between scie? 
and engineering in the working wo" ot 

ere is some tendency for a great 
number of course choices to be offere” 
the student, thus increasing the poss) 
bility of accommodating both practic®” 
and theoretical interest patter? 
within the same department. 


Summary 
Measuring Intraception-RxtraceP” 
tion and Exocathection-~Endoeathec’ 
tion in the freshman class at a schoo 
of science and engineering showed al- 
most no Intraceptive Persons pres ent. 


f 
problem 9 
g once 


lr ree 


PERSONALITY FACTORS 


Th . : 
: abe eenening men tended to 
study mi as a major field of 
Satie i e oriented toward the appli- 
and ike skills in managerial 
eer ee rative occupations as 
in mee ee and success, 
ringing satict amily relationships as 
péstheuinc action in life. The En- 
Science as § men tended to choose 
oriented "hoy or field of study, were 
ills in Ps as utilizing theoretical 
Occupations uate school and scientific 
Self-actunl as means of achieving 
ty ae ahzation, and perceived fam- 
bringing ene activities as jointly 
ton a in life. In general, 
anxious * ecting men seemed more 
the os out their social status than 
ity ete emting men. Compatibil- 
ive Sates personal needs and objec- 
88 seen es of the school situation 
higher s a major factor in the 
ade point average of the En- 


231 


docathecting men, and in the lower 
separation rate of those students who 
had made ego syntonic curriculum 
choices. Some practical recommenda- 
tions stemming from these findings 
were briefly discussed. 


REFERENCES 


Hincarp, E. R. Introduction to psychology. 
(2nd ed.) New York: Harcourt, Brace, 
1957. 

InxeLes, A. & LevINsON, D. J. National 
character: The study of modal per- 
sonality and sociocultural systems. In 
G. Lindsey (Ed.), Handbook of social 
psychology. Cambridge, Mass.: Addison- 
Wesley, 1954. 

Levixson, D. J. Role, personality, and social 
structure in the organizational setting. 
J. abnorm. soc. Psychol., 1959, 58, 171- 
180. 

Murray, H. A. Explorations in personality. 
New York: Oxford Univer. Press, 1958. 


(Received September 13, 1960) 


Journal of Educational Paychology 
1961, Vol. 52, No, 5, 232-235 


THE DRAW-A-MAN TEST IN INDONESIA’ 


R. MURRAY THOMAS 


University of California, Santa Barbara 


Indonesia, a nation of perhaps 90 
million people, has no standardized 
intelligence scale. This state of affairs 
is the result of a variety of factors, in- 
cluding a dearth of trained test con- 
structors, the existence of many dif- 
ferent languages in the nation, and the 
impossibility of establishing accurate 
birthdates for many subjects who 
might form a standardization group. 

In trying to provide a temporary 
instrument to use with children until 
an adequate test is developed, the 
writers tried the Goodenough Draw- 
A-Man scale on a sample of 1,245 
children in Bandung, a modern city of 
about one million inhabitants in the 
mountains of West Java. 

The results of this study are re- 
ported here as: (a) a comparison be- 
tween the success of the Indonesian 
children and the success of the Ameri- 
can children in Goodenough’s original 
standardization group to discover 
whether American standards for the 
test should be transferred directly for 
use in Indonesia, (b) correlations be- 
tween the Indonesian children’s Draw- 
A-Man scores and their teachers’ rat- 

ings of their general intelligence, and 
(c) correlations between the Indo- 
nesian pupils’ Draw-A-Man scores 
and their marks in Several school sub- 
jects to discover whether the test cor- 
related with achievement in school 
subjects. 


>This study was carried 
teacher education project of ¢ 
versity of New York finance, 
Foundation. The aid of Wina: 
and research assistants Soep 
Azis, Suharsini, and 8. Hu 
fully acknowledged. 


out under g 
he State Uni- 
d by the Ford 
to Surachmad 
‘arman, Mariani 
tabarat is grate- 


AND 


ANWAR a adut 
Padjadjaran University, P@ 


METHOD 


Subjects 


-e drawn 
The 1,245 children were A vl 
three sources: 639 childret enrolling 


: e pry 
Grades 1-6, 524 childen from _ and 


from 


82 children from three ain ki 
gartens (there are no pu 
artens) . rane 
. The a were selected PY, the 
dom sampling techniques child 
Bandung district, and wee er 
class the days of the tes eo final 
spring 1960) was included. ce won Wee 
compilation of data 109 chil “pecatse 
eliminated from the sample estab- 
their birthdates could not pe pec aus? 
lished, and 28 were eliminated ars 
they drew women or included ¢ mab. 
boats instead of drawing only ® e 
This reduced an original aes 
1,382 to a usable sample of 1,249 4, 
The nation’s ideal of compu pee? 
universal education has not ort 
realized, primarily because of 4 § an 
age of school buildings. Thus a 
Bandung children attend schoo ¢ the 
inspection of the occupations © aw’ 
fathers of school groups like this § at 
ple suggests that it is biased somew? 


f + soci? 
In favor of the middle and upper so 
economic classes, 


- 


Procedure 


_ In each classroom the research - 
Sistant. who administered all of © 
tests passed letterh 
the subjects and a 
with pencils, a picture 
the pupils 


THE DRAW-A-MAN TEST IN INDONESIA 


oe overall intelligence and to place 
Brice: ae of seven graduated cate- 
‘eon : intelligence. (The Indonesian 
heals etjerdasan was used to mean 
He clare The investigators’ pur- 
relation by learn whether there was a 

ian poe scores on the Draw- 
ne to 4h est and intelligence, accord- 
mth is teacher’s own estimate of 
inte 8 overall ability. The seven 
i ee categories were named: 
Rhee cig quite high, somewhat 
te oe average, somewhat be- 
est, Thee quite low, and very low- 
think af teachers were instructed to 
echo! (Ecce levels as referring to 
Was foe : den; so the lowest category 
tnd he least intelligent children 
gent jn cha not the least intelli- 

€ general population. 


i 


ReEsuuts 


T 
opie “ie compares the Indonesian 
&toup fF. the normal-school-progress 
standardise Goodenough’s original 
Compar er sample. (Goodenough 
With an advanced-progress group 
Piugios Cee and retarded- 
8 andarde we but she based the test’s 
0 $ only on the normal-school- 
a rou) 
nhough the Indonesian data were 
Vate Separated for analysis into pri- 
eang nd public school samples, the 
Wives standard deviations for the 
0 be aud public schools were found 
level. almost identical at each age 
8rour Since it appeared that the two 
tion PS represented the same popula- 
Sone ee combined for compari- 
N with the American samp'e- ; 
es shown by Table j, the ro 
hel ‘8 mean was above the a e- 
ugh-normal-progress mean 8 ages 
difference significant at .01 level) 
and 6 (significant at .05 level). But 
1,2 ond age 6 the Indonesian means 
agged pehind the Goodenough-nor- 


* 233 


mal-progress means at all ages from 
7 through 12 (all significant at .01 
level) with the exception of age Il 
where the Indonesian group spurted 
ahead of the Goodenough group & 
slight but statistically insignificant 
amount. (At no age did the Indonesian 
average rise as high as the Goode- 
nough - advanced - school - progress 
group or fall as low as the Goode- 
nough - retarded - school - progress 
group.) 

Indonesian scores on the Draw-A- 
Man Test at each age level showed 
significantly more variation than did 
the Goodenough - normal - progress 
scores. These differences in variances 
were significant at the .01 level for 
ages 6, 7, 8, 10, and 11. They were sig- 
nificant at the .05 level for ages 5, 9, 
and 12 (Table 1). Perhaps this is due 
to the fact that by definition the 
Goodenough-normal-progress group 
did not contain the extremes that 
would be found in an unstratified sam- 
pling of school children. 

In Table 2 correlations (7) between 
Draw-A-Man scores and teachers’ rat- 
ings of intelligence are reported for 
each grade level, kindergarten through 
sixth. The correlations were all posi- 
tive and moderately low or, as in 
Grades 2 and 4, quite low. 

Table 3 shows correlations (r) be- 
tween pupils’ Draw-A-Man Test 
scores and the most recent end-of-the- 
semester marks received by the pupils 
jn various school subjects. (The Indo- 
nesian marking system uses numbers 
ranging from 1 to 10, with teachers 
also commonly adding marks halfway 
between each pair of numbers,, such 
as 52. The extreme marks of 1, 2, 9 
and 10 almost never appear.) tae 

All correlations were positive. They 
ranged from very low (.04) to rather 
high (.80). In general the highest cor- 
relations were between the Goode- 


234 


R. MURRAY THOMAS AND ANWAR SJAH 


TABLE 1 
Comparisons oF INDONESIAN WITH AMERICAN GROUP 
Age at last birthday 
Group 
5 6 7 8 9 10 i 12 a8 

Indonesian: 

N cases 35 | 104 =| 135 | 202 | 204 | 183 | 165 | 121 | 98 P 

M 13.6] 15.1] 16.8] 20.0] 93.3] 96.9] 33.2| 32.2 ae 

Median 12.4) 14.7! 16.3] 19.5] 99.4] 24/9] 34.8] 32.7 vr 

SD 53) 5.5) 6.0] 63! 7.9] 83] 8.5| 8-6] 6 
Goodenough-nor- 
mal-progress: 

N cases 623 802 669 503 504 8351 101 22 

M 10.2) 13.9} 18.5] 92.5] 926.91 °30.6| 33.0] 35.2 

Median 9-9/ 13.9] 17.8] 92°3| 95:8] 30.7] 33.1] 36.1 

SD 41] 44] 4.7] “5:41 70] “ee| 66] 6.2 


nough test and arithmetic marks, Next 
highest were those in physical science 
(not taught below Grade 5), geog- 
raphy, and biology. On the average, 


TABLE 2 
CorRELATIONS oF Gooprenover Test 
Scores wire Tracner Ratinas 
or INTELLIGENCE 


ee 


Grade level 
_ 
k 1 2 3 4 5 6 
N 82 | 210 | 238} 221 | 218 151 | 97 
r +22) .32**| .07|.27**! 11] og+s 29% 
“p< 05. 
+p <01. 


correlations with reading, Indonesia? 
language (the national language) » 22 
Sundanese (the local dialect) wer 
somewhat lower. 


Summary anp ConcLusioNns 


In Bandung, Indonesia, the igi 
School children, kindergarten through 
tade 6, who were given the Goode- 
Rough Draw-A-Man Test did not, 02 
he average, score as high as did 4 
merican group on which Goodenous? 
originally based her standards for the 
test, Suggesting that the America? 
Standards for mental ages should not 

© applied to Indonesians, 
€n Goodenough test scores were 


TABLE 3 
Corretations or DRraw-a-May Sconi 
ES AND Supynop Marrer Marks 
Gi 
Subject matter Tade leve) 
1 2 
(V = 210) (W = 23 3 
7 WS at we 214) ww Sassy ay © 108) 
Arithmetic ae “5 ae - 
Reading +43 22K 1545+ 55% 56** 30** 
Indonesian 07 31 “Doe8 29+ Soe Be 
Sundanese -26 -06 15+ a. rie apee 
Bulogiual ed 488 “30ee 27% 04 
Biological science a aes vee 
Physical science : “O7#% 55 
44 ee 
*p < 05. 6! 


ep < 01. 


THE DRAW-A-MAN TEST IN INDONESIA 


een with school marks in var- 
peat le matter areas, the median 
xelake, 1on was about .30. The highest 
ee were found between the 
hint: 4 Tan Test and arithmetic 
eat (from r = .42 to r = 80). Cor- 
sed Pel cel ae Draw-A-Man scores 
intelli “- ers’ judgments of children’s 
is BD g nes were low, ranging from .07 
" for different grades. 

Tah Writers concluded that the 
€nough test yielded sufficiently 


235 


promising results to warrant its use as 
a measure of ability for Indonesian 
children (especially for estimating 
arithmetic and science abilities in up- 
per elementary grades) until a more 
adequate test has been developed. 


REFERENCE 


GoovrnoucH, Frorence L. Measurement of 
intelligence by drawings. Yonkers, New 
York: World Book, 1926. 


(Received December 21, 1960) 


Journal of Educational Psychology 
1961, Vol. 52, No. 5, 236-240 


AMERICAN GRADUATE STUDENTS: 


SOME CHARACTERISTICS OF ASPIRING TEACHERS 
AND RESEARCHERS 


DAVID GOTTLIEB : 
Department of Sociology and Anthropology, Michigan State University 


In recent years, increasing attention 
has been paid to the phenomena of 
adult socialization as observed within 
American colleges and graduate 
schools (Becker & Strauss, 1956; 
Gottlieb, 1961; Jacob, 1957; Merton, 
Reader, & Kendall, 1957), While the 
ratio of speculation to empirical find- 
ings in this field is high, the available 
material would indicate that changes 
do occur in the values, attitudes, and 
behavior patterns of students in train- 
ing and that these changes may at 
least in part be attributed to the social 
system of our educational institutions. 
Several investigators (Becker & Geer, 
1958; Spaeth, 1959) have attempted 
to note how changes in the occupa- 
tional values expressed by graduate 
students may be tied in with a theo: 
of “anticipatory socialization” (Mer- 
ton & Kitt, 1955), a process by which 
the individual adopts the values ofa 


group to which he aspires but does not 
yet belong. 


Purpose 


The research being reported here 
deals with a comparison of two groups 
of American graduate students: (a) 
the student who comes into the grad- 
uate school system with a preference 
for a career in either teaching or aca- 
demic research and reports that there 
has been no change in his preference 
as he has moved through professiona] 
training, (b) the student who States 
that since he has been in graduate 
school either teaching or research has 
become more attractive a8 & post- 
graduate career choice. The major 


. ti- 
purpose of this study was to ho find 
gate how these students, hi cial 
themselves at different stages 0 a ary 
ization and career commitment, atti- 
in respect to certain values aD 
tudes. 


Mernop 


a earch 

Subjects. The National Gpition Be from 
Center's sample of graduate stu has bee 
which the data for this analysis ter sam- 
taken is a two-stage stratified clus raduat 
ple of the enrollment of American! “radi- 
schools which offered the PhD in t ‘turn oO 
tional arts and sciences” in the ay 95 grad- 
1858. A total of 2,842 students from 25 
uate schools participated in this surv' meas- 

Procedure. In order to obtain some areet 
ure of the source and direction of llow- 
change each student was asked the fo 
ing: 


: of 
Please rank the following in terms 


occur 
your personal preference as a future 
pation, 

a. Teaching undergraduates sae 


b. Teaching graduates =o 

¢. Doing research in your field —— 

d. Academic administration pltern®- 
Have your opinions on these alte g° 
tives changed since you first decided letter 
to graduate school? Please write the le or 
or letters of alternatives which seem ™ 

esirable to you now. 


‘ e 

The responses to this set of questions We? 

used as the basis for a change index. nen 
who indicated that any alternative was mOr” 
esirable to them are considered “changers 

Persons who indicate no change in prefere?' 

are considered “nonchangers.” It was iouny 
that 41% of the graduate students in this 
* The research reported here is part of # 
larger investigation of American graduate 
by th i inio? 
Research Center. 1 am Cn OR 
Study Director, and to Joe 


2 a ir 
continued cooperation.» Director, for thei 


236 


ASPIRATIONS OF GRADUATE STUDENTS 


TABLE 1 
CareeR PREFERENCES AND CHANGE STATUS* 
Present preferred Changing to Changing to x, 
erred career teaching research No change N Total % 
(%) (Go) Ges 
Rillese teaching 19 19 62 888 100 
ral 10 26 64 936 100 
4 8 17 75 812 100 
At this point ay 
fttina group of ret alee of changers is limited only to those who report a change toward research or teaching. 
ese in conjunction wines changers include those who show a preference for both teaching and research, either of 
ith administration, or some other alternatives in addition to those mentioned above. 
secondary school 


tea The “other?” neat 
Ching, P includes students who wish to enter a professional practice, administration, 


Parish work, ete, 


Sample ; 

d 

the: © report a chan sna 
heir caree ge of some kind in 


Differences in Values and Attitudes 
Included in the graduate student 


Tab] ESULTS AND Discussion questionnaire is a set of “dilemmas,” 
able ] p é a group of imaginary situations in 
terms of cae which students in which the seaponitent 12 asked, after 

career preference }eing confronted with two lines of ac- 


ave ch 
change need and the direction of that 
From 
about» oo 1 it can be seen that 
@ career 4 ird of the students who plan 
n either teaching or research 


tion with alternate implications: 
“What would you do?” or “What 


would you prefer?” 
One dilemma has to do with the dis- 


tinction between “localism” vs. “cos- 


indicat 

: e 

euetest tole has been a career mopolitanism” in academic careers 
© “others” there j some kind. Among and consists of the following question: 

spmounting rig is less overall change “Tn the long run would you rather be 
“bj ects in that one out of every four |nown and respected (a) throughout 
i at category. In respect to the institutions where you work or (b) 


ire 

bees of change Table 1 shows: (a) 

rence me state a current carreer pref- 
190, doce teaching are evenly split— 
teachin ating a stronger interest in 
1 reg & and 19% a stronger interest 
arch, (b) 10% of the researchers 


among specialists in your field in dif- 
ferent institutions?” Subjects who 
checked a were classified as “locals” 
while those who checked 6 were con- 


sidered “cosmopolitans.” 
Table 2 shows that in each case 


are 
Warde” more favorably disposed to- 

ie teaching while 26% state they Cee 
a find research more attractive as Caner Y eke Nee CHANGE, AND 
« Postgraduate career, (c) in the Se 

“ner” category change g0es in the Cosmopolitan 
9 ection of a eel at a better than... a) 
th 1 rate. The data suggest that ferred career | y5 linnes Gi 
a graduate school experience has the jingareer_ | in career N 
react Of bringing students closer to a as a 
search orientation, and that this Teaching 4s | 552] 53 | 369 
dev’ toward research 1s felt by stu- Research® w 586 67 587 

© in .each) © the occupational a7 

x*—significant at <.001 level (research, changers, 


8toups. 


nonchangers). 


238 


TABLE 3 


Career PREFERENCE, CHANGE, AND 
Work ORIENTATION 


Percentage who would devote all or the 
bulk of their uncommitted time to work 
Present pre- 
ferred career 


No change 45 Change 2 
in carreer N in career N 
preference preference 


{ 
Teaching* 69 552 52 | 30 


Research 69 554 64 590 


*x*-significant at <.001 level (teachers—changersa 
and nonchangers). 


those with the research orientation 
show the greatest desire for the na- 
tional recognition. The difference 
between the changers and the non- 
changers is found when the actual per- 
centage differences are compared. The 
difference is one of intensity and not 
direction. Assuming that one’s cosmo- 
politan orientation is tied to his career 
preferences, the results in Table 2 are 
what might be expected. Researchers 
who have not changed are most cosmo- 
politan as this is the type of profes- 
sional view that would go with an un- 
changing attachment to a research 
career. Teachers who have not changed 
are least cosmopolitan since their 
frame of reference and operation 
would most likely be the local aca- 
demic institution. 

Significant differences were also 
found when respondents in each of the 
four groups were compared in respect 
to the postgraduate use of free time. 
Table 3 shows that while almost two- 
thirds of the research changers are in- 
clined to devote the major part of 
their time to their work, only about 
half of the changers toward teaching 
are so inclined. A tentative interpreta- 
tion would be that changers to teach- 
ing consist of a number of students 
who see teaching as a way of life in 
which the individual is able to regulate 
his time since he is guided by the num- 


DAVID GOTTLIEB 


ber of hours a specific class me 
may well be that these are Boel 
students who hold the more a ae 
(and perhaps naive) view of i oi 
demie world—one in which ik 
fessor prepares his lectures, me ‘itude 
classes, and then retires to the aba 
of his study. The nonchang!ng " ae 
appear to be no different than ' on 
searchers in the use which they ; 
make of their free or aang ne 
time. Career preference and piel 
of career commitment are also view 
to be related to how respondents, 
the teaching and research an 
Table 4 shows that resentebet ie. 
least likely to agree with hi ee 
ment, “teaching is intrinsical 2 who 
satisfying than research, aoe ne 
have changed to teaching, 20 being 
teachers who have not changed oat 
most in agreement with the meen ” 
Not unlike the “local-cosmopo?™ 
situation it would appear that eae 
Sponses of the changers to pet 
and research is partially ssapoe j 
by the fact that these are studen ie 
transit. Included are: (a) stude 46° 
who have moved from teaching Ot y 
search and now question the vali “A 
of the statement; (b) students 
entered the graduate system wit oe: 
preference for teaching and who ba h 
withstood to some extent the resea! ss 
influences of the graduate school, 2? 


TABLE 4 
Career Prererence, CHancn, AND 
TEACHING versus RESEARCH 


Percentage agreeing that teaching }§ 
intrinsically more satisfying tha? 


research 
Present career 


Preference 
Nees | a moe | ¥ 
preference preference 
—— 
Teaching | 588 | 55 | 368 
Research* x3 580 26 570 


\ 
\ 


ASPIRATIONS OF GRADUATE STUDENTS 2 


ina : 

- Pie have rededicated themselves 

fess eens (c) those students who 
ade the shift from research to 


teachi 
ching. On the other hand, the non- 


ch ° 
‘Ngers present a different picture. 


er 
ue ohn apie who have retained 
: yo they brought with them 
and ‘anu in respect to teaching 
Searchers % - The nonchanging re- 
: are strongest in their con- 


Victi A 

satistyine ne in fact, research is more 
who a han teaching. The teachers 
Sargent not changed are most willing 
hold ang the idea that teaching does 
search, Intrinsic satisfaction over re- 


Res 
situation = to yet another “dilemma” 
Support i further evidence to 
Significan 2 proposition that there are 
alues wh differences in attitudes and 
Ween to nen comparisons are made be- 
Students ato and research oriented 
Asked th, n this case respondents were 
he following: 


Yo 
Bood dae a full professor in a small but 
Considered een Two men are being 
Re can b for a tenure position, but only 
mith 9€ appointed. 
facutts” #8 very well liked by the students, 
e isan a throughout the institution. 
Well reg . ective teacher and is extremely 
area @%) both in his field and in other 
Tt j ‘ 
Q eo quite clear, however, that he is 
Shee produce research or work 
ental arship which will make 2 funda- 
One, Contribution in the field. 5 
donna 78 known to everyone as a prima 
a. He is difficult to get along with, 


an 

inte Students complain about his Jack of 

Test in teaching. It is quite clear, how- 
he is destined 


ey, 
er, that in the long ruD 
in © one of the major intellectual figures 
re field, nics 
i te for when 
th ich man would you V° 3 
Sona artment makes its recommenda: 
: . 
vt i d ear, 1S most 
@ group, it would appe?s mn 
meer to give support to his own kind. 
ch © teachers, be t! 
§ Angers, are most enthus! 
"ith. The researchers; 


TABLE 5 


CAREER PREFERENCE, CHANGE, AND 
Tenure DILEMMA 


Percentage who would choose Smith 


Present career 


preference No change Change 
incareer | N | incareer | V 
preference preference 
Teaching 88 552 85 370 
Research 68 593 71 5838 


those who report no change, are least 
interested in bringing Smith into the 
department. While Table 5 does sup- 
port a general hypothesis of differ- 
ences among teachers and researchers, 
it presents data of additional interest. 
While each group indicates a prefer- 
ence for a colleague who is most like 
themselves, it will be noted that in 
each cell the majority express a pref- 
erence for Smith—the teacher. It is 
possible to only speculate why this is 
so. Perhaps individuals prefer Smith 
since he seems most congenial and 
easiest to get along with. On the other 
hand it may well be that researchers 
and teachers are expressing a need for 
something they find lacking in their 
own graduate experience, namely a 
faculty member who would give more 
of his time to his students and less to 
his research interests. 


SuMMARY 


The results of a national survey of 
2,842 graduate students would indicate 
that graduate students do alter their 
career preferences as they progress 
through their professional training. 
Significant differences are found in at- 
titudes and values when these stu- 
dents are compared in respect to ca- 
reer preference and degree of career 
commitment. 

The data presented would 
that a form of “anticipatory i 
tion” is in operation and has the effect 


240 


of preparing students for their post- 
graduate careers, 


REFERENCES 


Becker, H. B., & Geer, D. The fate of ideal- 
ism in medical school. Amer. soc. Rev. 
1958, 23, 50-56. 

Becker, H. B., & Srrauss, A. Careers, per- 
sonality and adult socialization. Amer. 
J. Soc., 1956, 62, 253-263. 

Gorrurs, D. J., Processes of socialization 
in American graduate schools. Soc. 
Forces, 1961, 40, in press. 

Jacos, P. BE. Changing values in college. New 
York: Harper, 1957. 

Merton, R. K., & Kirt, A. S. Contribu- 


DAVID GOTTLIEB 


ions to the theory of reference 
hae In R. K. Merion os sec 
Lazarsfeld (Eds.), Continuztres 7 i 
research: Studies in the scoP ae 
method of the American soldier. 
: Free Press, 1955. Pe 
Maco, BW. peaen, Gy & Kewoatiy Fe 
The student physician: nn F GHE 
studies in the sociology of ne epiver- 
cation. Cambridge: Harvar¢ 
Press, 1957. ' areer 
Spartu, J. L. Value orientations ae ae 
preferences of graduate studen + 
read at the Chicago meetings pterl- 
American Sociological Society, 
ber 1959. 


(Received February 8, 1961) 


group 


Journal of E a 
‘ducational Psych 
1961, Vol. 52, No. 5, 211-218 a 


THE ACQUISITION AND RETENTION OF PAIRED 
ASSOCIATES BY GOOD, AVERAGE, AND 
POOR READERS’ 


WAYNE 


OTTO 


University of Oregon 


The 


el present study investigates the 


- ohio ni of good, average, and 
it of a lets to learn and to recall a 
le eee associates. The particu- 
oe es selected for study in this 
os fon were chosen primarily be- 
&xploratory Se alaladebac ity 
€ explor: 

lem: sPloratory study employed a 
.“arning task similar 10 the ai weet 
. ee study: a list of five 
ofa “hasan each pair consisting 
Beguine geometric form and a low 
gee value trigram. The task 
Teadera nated between good and poor 

othe Sufficiently well to suggest 
several pi sninarpeeou in examining 
study pecific questions raised by the 


Paire 


Tq ‘git the earlier study, the subjects’ 
intelli €s varied considerably. With 
iting level differences mini- 
"Nees Nias the demonstrated differ- 
abilit - tween good and poor readers’ 
Sist? Y to master paired associates per- 
Diop The exploratory study had em- 
fiaeet good readers and extremely 
€ne ed readers. Would the differ- 
Conee Persist if the poor reading group 
oy Sisted of subjects reading 17 the 
We er quarter of their class? How 

uld a group of “average” readers 


Perform? 
f 8. There appeared to be @ tendency 
Y visual-auditory reinforcement to 
AW : d upon data collected 
he study is Lek at The University of 
is grateful for help- 
co: his thesis committee: Theo- 
dore ane ee ‘Chester W. Harris, and 
Virgil E, Herrick- 


1 


241 


result in more rapid learning than did 
purely auditory reinforcement. With 
more adequate controls, would a par- 
ticular mode of reinforcement con- 
tinue to be more effective than 
another? Or, with provision made for 
the study of a possible interaction be- 
tween mode of reinforcement and 
reading level, might one mode of re- 
inforcement prove to be better for poor 
readers while another would be better 
for good readers? 

4. If grade placement of subjects 
were to be varied systematically would 
this grade placement variable interact 
significantly with reading level and/or 
mode of reinforcement? For example, 
might not the most effective mode of 
reinforcement vary from grade level 
to grade level? 

5. What is the effect of these vari- 
ables—reading level, grade placement- 
chronological age, and mode of rein- 
forcement—upon total trials required 
to relearn the list after 24 hours? The 
results of the earlier study suggest that 
there would be no difference in num- 
ber of relearning trials required by 
good and poor readers. : 

Studies have been reported that 
have been concerned with one or more 
of the above questions. Five general 
statements regarding these studies 
can be made here. 

1. Level of intelligence appears to 
be a significant variable in associative 
learning. There appears to be an in- 
verse relationship between IQ and 
trials required to master a list of 
paired associates (Gates, 1930). 

2. Studies show that associative 


242 


WAYNE OTTO 


learning ability and reading ability 
are directly related, but there is little 
agreement as to why or to what ex- 
tent this is so (Fildes, 1921; Gates, 
1935; Hincks, 1926; Monroe, 1932; 
Raymond, 1955; Vernon, 1958). 

8. In general, studies have failed to 
demonstrate a clear advantage for any 
one type of reinforcement. The usual 
types of reinforcement investigated 
have been auditory, visual, and tac- 
tile/kinesthetie, presented singly and 
in combination (Berman, 1939; Burt 
& Lewis, 1946; Kirk, 1933; Raymond, 
1955; Rivkind, 1958; Stauffer, 1948; 
Williams, 1952) . 

4. Studies have not shown adequate 
control of the learning tasks employed 
to permit conclusive interpretations of 
results. The failure of many studies to 
demonstrate advantages of any par- 
ticular mode of reinforcement may 
simply reflect shortcomings of the task 
employed rather than absence of ef- 
fects of the task variables. 

5. There has been no real concern 
with what might be important inter- 
actions. Studies have generally been 
concerned with but one level of read- 
ing ability, with a particular mode of 
reinforcement, with a general range of 
chronological ages or levels of ability 
at the possible cost of what might have 
been significant interactions between 
two or more of these variables, 

The present study, then, attempts to 
control and study what appear to be 
pertinent variables and make allow- 
ance for consideration of possible sig- 
nificant interactions between selected 
variables. The study is an experi- 
mental investigation of relationships 
that, once demonstrated, can serye as 
bases for further research in learning 
to read situations. 

Three hypotheses provide a focus 
for analysis of the data and subse. 
quent discussion. 

j. Good readers, average readers, 


di in the 
and poor readers will differ —_ - 
number of trials required to 
list of paired associates. ais aa 

2. Mode of reinforcement will 
act significantly with reading 
and/or grade level. ers 

3. Good, average, and poor er 
will not differ in number of tr 


. sO- 
quired to relearn a list of paired 28 


Be jals. 
ciates 24 hours after acquisition t 


METHOD 


sod: read- 
Three major variables were etme ages 
ing level, grade placement-chrono 
and mode of reinforcement. 


Subjects 


One hundred and eight ae 08 
replications of the 3 X 3 X 3 Hear 0 ele- 
from a total population of about *gchools: 
mentary school children from erie 
served in the experiment. All sub} 
the following criteria: nore bem 

Intelipence. A group test 1Q ape otis 
tween 95 and 110. Scores were from forms 
Quick Scoring at Grade 2, and from. !r,6 
of the California at Grades 4 and pacity 
attempt was to hold intellectual ce ing 
within the average range by elm) rage” 
high and low deviant scores. The og the 
Score was subject to corroboration 
classroom teacher, falliné 
_ Reading level. Reading test scores oted 
in =e of three specified categories ae 
Sood, average, or poor. Scores we 
forms of the Metropalitan Achievern®?, 
Tests. Raw test scores were converted ‘ais 
Stanine equivalents, which divide the sie 
tribution into nine subdivisions of appr 
mately the same range. Thus, operation’ 
defined, good readers were those whose | th 
Scores fell in the seventh, eighth, or oe 
Stanines; average readers had scores fal Si 
in the fifth stanine; and poor readers }: 


ird 
Scores falling in the first, second, or thir¢ 


Stanine. The fourth and sixth stanines We!’ 
dropped out to 


veer 
avoid overlap betwe 
Sroups. The level to which a subject 
assigned ww; 


as subject to corroboration 
the classroom teacher. 


Grade level. Place: 
or sixth grade. Sub: 
three grade levels in order ¢, 08° 
sible effect of ed cone nea: Shere 
chronological age 


(four 
sen 


logical 


al age range wa 
to eli 


limited to 12 months minate a wide 


ACQUISITION AND RETENTION OF PAIRED ASSOCIATES 


deviati , 

ines the usual age at grade level. 
elected ene ilden who met the criteria was 
Ge ec our subjects were then assigned 
Possible a. condition. Since the 
ing ehicisnee of sex differences upon learn- 
conditions y under any of the experimental 
WO gitls ene: not known, two boys and 

ndition, =, assigned to each experimental 
Were demonstrated no systematic differences 
Will not be fakes i ai of sex, this 


Task 


List. ‘ 
In the ae paired-associates list employed 
study dows’, evolved from the preliminary 
&@ comm y the writer. Each pair consists 
SOciation si geometric form and a low as- 
Archer (1960) list CVC trigram from the 
6 0 ’ 
beca Poe geometric forms were chosen 
that most al minary investigation showed 
Ominant, ementary school children had a 
cle, trian oe for each one (diamond, cir- 
Tigram anil star, square). Geometric form- 
trigrametries appeared to be preferable to 
Confusing ae pairs to avoid needlessly 
With poor € younger subjects and subjects 
Stimuli in ba aid ability with two word-like 
ete air. 
, bo sae chosen had association val- 
ist. The fi or less according to the Archer 
Ploy the ve trigrams were chosen to em- 
vowels—q — sounds of the common 
repBat in — repeating any 
fume ch tee list follows. (The common 
orm. its iS form is given here in lieu of the 
Neleswee f.) Diamond-fep, circle-miv, tri- 
Teinfore c, star-yad, square-gox. In those 
Jee face conditions in which the sub- 
Stams Pls visual presentation of the tri- 
Case m, he trigrams were printed in lower 
ines mectipt style. 
Parison it was felt that some 
Vidual of associations evoke 
8toups Orms and trigrams 12 
terprotice subjects would 
ple Ing the data, an 
Sults Mented in the study- 
oe pil be alluded to, but §P 
further elaboration here. a oe 
em, Pparatus, A Stone Me oxy! Gru "wee 
Ployed ; trolling presentation of 
forms in con encasing the re- 
Volyj, The original ps cn opening 5” X 
57 ge cylinder Prove muli. A slide open 
ing ee Presentation © ait the experimenter 
Fe devised to perortions of the total 
ening ‘The stimulus sheet mounted: 60 


basis for com- 
d by the indi- 
the different 
in in- 


243 


ihe cylinder was so arranged that five dif- 
ferent orders of presentation were possible. 
The forms presented on the memory drum 
were ¥%” high. 

Acquisition. All testing was done in a 
room with only the experimenter and the 
subject present and with a minimum of dis- 
tracting outside noise. The writer served as 
experimenter throughout the experiment. 

Each form was presented for 4 seconds— 
at which point the subject was expected to 
say the form name—followed by a 4-second 
presentation of the form-trigram, followed 
by a 4-second rest. There was a 16-second 
intertrial rest. Pairs were first presented in 
serial order until the subject reached the 
criterion of one correct anticipation of form 
names for the entire list. Pairs were then 
presented in scrambled order to the same 
criterion. The total score was serial plus 
scrambled trials to criterion. This sequence 
of presentation was employed because im- 
mediate scrambled presentation proved to 
be disconcerting to the younger subjects. 
The scrambled presentation countered any 
sequential learning effect. 

Recall/relearning. Subjects were asked to 
relearn the list after 24 hours. Pairs were 
presented in scrambled order in the same 
manner as in acquisition trials. The subjects 
continued to the criterion of one correct 
anticipation of form names for the entire 


list. 


Reinforcement 

The term “mode of reinforcement” is 
used in the present paper to refer to the 
manner in which the correct form-trigram 
pairs were presented to the subjects. “Con- 
firmation” might be an equally apt term. 
The actual procedure is described in the 
following paragraphs. The point to be made 
here is that the word “reinforcement” does 
not conform to the strictly conventional us- 
age. 
Three modes of reinforcement were em- 
ployed. In practice the reinforcement was 
cumulative: auditory, visual-auditory, and 
kinesthetic-visual-auditory. For the sake of 
simplicity, however, the three modes of re- 
inforcement will be denoted auditory, visual 
and kinesthetic, respectively. , 

Auditory. The form was presented for 4 
seconds, followed by a second 4-second pres- 
entation of the form accompanied by the 
experimenter’s articulation of the appropri 
ate nigra: Comm name). pri- 

tsual. The form was presented for 

seconds, followed by a second 4-second ee 


WAYNE OTTO 


TABLE 1 
Mean Acquisition TRIALS 


ji t 
Grade Reading level Reinforcemen' 
. jnesthetic 
@ 4 6 Good Average Poor Auditory Visual Kin 
28 
2.33 ll. 
16.42 11.42 g.4 8.67 13.03 15.28 13.36 12.3 


entation of the form accompanied by the 
experimenter’s articulation of the appropri- 
ate trigram plus visual presentation of the 
trigram. For the visual Presentation, the ex- 
perimenter held a card bearing the printed 
trigram contiguous to, and to the right of, 
the form presented by the memory drum, 
Cards, 1” x 2”, with Ye” x Yr centered 
letters, approximately the size of the forms, 
printed in black were employed. 

Kinesthetic. The form was presented for 
4 seconds, followed by a second 4-second pres- 
entation of the form accompanied by the ex- 
perimenter’s articulation of the appropriate 
trigram plus a visual-kinesthetic presen- 
tation of the trigram. For the visual-kines- 
thetic presentation, the subject was pre- 
sented a card bearing the form-trigram pair 
and asked to trace the tri 
finger. Cards, 10” x 4”, with 1A” x 2” let. 
ters, a size which the subject could readily 
trace, printed in black were employed. Let- 
ters were lower case manuscript, of the type 


most familiar especially to the second grade 
subjects. 


The subject gave he heart- 
shaped figure and was told that he would be 
asked to learn a new name. The test forms 
were then presented and Paired with the 
appropriate trigrams in the manner of the 
reinforcement to be given 


1 0 £ - Subjects to re. 
ceive kinesthetic reinforcement practiced 


tracing the sample trigram quickly, 
RESULTS AND Discussion 


A comparison of mean IQ scores 
shows no significant differences be- 
tween groups compared in subsequent 
analyses. It appears that the random 
assignment of the pooled subjects to 
experimental conditions was reason- 
ably successful in eliminating between 
group IQ differences; therefore, be- 
tween group differences do not appear 


ne be- 

to be attributable to 1Q differences 
tween groups. = ores 

Acgieatier Mean cs aga of g 
for the different groups for *ehown Me 
three main variables are Sh e (Ed- 
Table 1. An analysis of began 
wards, 1950) of the total ae There 
scores is presented in Table ‘oveneitY 
was little evidence of ed for 
of variance (chi square = hree mai? 
26 df, p > 20). Bach of the three Py 
effects is significant at the nce bY 
That differences in performan read- 
grade level are dependent errs the 
ing level of subjects is indicated e] in- 
Significant grade x reading cies 
teraction. That differences due to ) 
of reinforcement are dependent ee 
grade placement of subjects is ‘t jp- 
cated by the grade x reinforcemen ps 
teraction. There is no significant riod, 
ing level x reinforcement interac bite 
nor is there a significant triple in 
action. 

Table 3 presents results of Tukey 
Sap tests (Tukey, 1949), which we 


TABLE 2 


ANALYsIs or VARIANCE OF THE 
Acquisition TRIALS 


F 
Source af MS bee 
Grade (G) 2 |49s.94 [79.912 
Reading Level (L) 2 |406.735/65.147 
Reinforcement (R) 2 | 39.065 6.257, 
GxL 4 | 34.347] 5.50") 
GXR 4 | 30.975] 4.96 
LxR 4 5.16 
GxLxR 8 | 11.562] 1.85 
Within 81 6.244 
p< 01. 


SSS SS 


ACQUISITION AND RETENTION OF PAIRED ASSOCIATES 


245 


TABLE 3 
Resutts or Tuxey Test oF SIGNIFICANT Gaps 
G 
— Reading level Reinforcement 
Grade yy es 
fan trials Gap Level Mean trials Gap Mode Mean trials Gap 
2 
16.42 
P 15.28 A 13.36 
4 11.42 ii 9.25% 1.03* 
‘ A 13.03 ¥ 12.33 
6 vad 2.28** 4.36** 1.05* 
z G 8.67 KK 11.28*** 
P< 0, 
P< .01, 


The audi ; 
itory/kinesthetic between means gap is significant (p < .01). 


Tun 5 
ference. ermine where significant dif- 
Within 1 etween group means existed 
Cant differ three main effects. Signifi- 
Tals we erences in mean acquisition 
of the thr demonstrated between each 
Sach of th grade levels and between 
ifferene e three reading levels. The 
oreement between the mode of rein- 
approach} Mean acquisition trials are 
Cance fo Ing the .05 level of signifi- 
but onl t each mode of reinforcement, 
auditory the difference between the 
ent os and visual mode of reinforce- 
Urther eects the <.05 requirement. 
Cong} der ge ieation must, of course, 
that h the significant interactions 
evi ave been demonstrated. 
ag of the questions and hy- 
the - 3¢8, can be examined in view of 
&cquisition data. 
Doon pothesis 1: Good, average, and 
readers will differ in the number 
pals required to Jearn 4 list of 
Ted associates. 


TABLE 4 ie 
Mean AcquistTION ‘RIALS! 
Grapgp X RpaDING LEvEL 


“Average Poor 


Good 


The hypothesis is accepted, with 
the restriction that the grade level of 
subjects tested will tend to influence 
the magnitude of differences demon- 
strated. Table 4 makes clear the ex- 
tent of the influence of grade level 
upon differences between reading 
levels. Examination of the mean ac- 
quisition trials presented shows that 
poor readers in Grade 2 took almost 
twice as many trials to master the list 
as did good readers; whereas poor 
readers in Grade 6 took only 1.6 times 
as many trials as did good readers. 
Also, good readers at the three grade 
levels perform more nearly alike than 
do the poor readers. It is suggested 
that the task is not sufficiently difficult 
at the higher grade levels to preserve 
the marked dichotomy displayed at 
Grade 2. 

Hypothesis 2: Mode of reinforce- 
ment will interact significantly with 
reading level and/or grade level. 

The hypothesis is accepted: there 
is a significant grade X reinforcement 
interaction. Mode of reinforcement 
does not, however, interact with read- 
ing level, contrary to what seems to be 
a basic assumption when added cues 
are stressed in much remedial work. 

The nature of the grade x reinforce- 
ment interaction is made clear by 
Table 5. It can be seen that the most 


246 


TABLE 5 


Mean Acquisttion TrIats: 
Grave X REINFORCEMENT 


Reinforcement 
Grade 
Auditory Visual Kinesthetic 
2 16.33 18.42 14.50 
4 13.58 9.83 10.83 
6 10.17 8.75 8.50 


efficient mode of reinforcement (re- 
quiring the fewest trials) varies by 
grade level. With the second grade 
subjects, kinesthetic reinforcement ap- 
pears to make for the fewest trials; 
whereas, visual reinforcement appears 
to be most efficient for the fourth grade 
subjects and kinesthetic and visual are 
almost equally effective for the sixth 
grade subjects. The implication seems 
to be that the function of different cues 
supplied by the several modes of rein- 
forcement changes as subjects progress 
in grade level and chronological age. 
This may help to explain why gross 
studies often fail to show a particular 
mode of reinforcement to be better 
than others. In order to secure positive 
results in any predictable manner, one 
would need to control grade level- 
chronological age. 

The answers to two of the questions 
raised by the earlier study are clear 
from the consideration of Hypothesis 
1 and Hypothesis 2: (a) When intel- 
ligence is held within a stated range, 
will good, average, and poor readers 
differ in the number of trials required 
to learn a list of paired associates? 


WAYNE OTTO 


. . . C- 
(b) Will there be significant he 
tions between reading level, “i? The 
reinforcement, and grade ae aes 
answers to both questions are ® ae 
tive, subject to the restrictions es 
out in connection with the hyp° io 

The answers to two other oe aon 
are also affirmative: (a) Will t + ie 
ditional modes of senor rit g 
nificantly influence rate of ae oa : 
It has been shown that the pat Ak 
mode of reinforcement that is m0 ade 
fective will vary from grade to f age 
(b) Is grade level-chronologit™ i ng 
a significant variable in deter sie 
the number of trials required to “ 02 
a list of paired associates? sae fact 
ative answer is restricted by th 1s re 
that differences in number of ae 
quired at different grade levels ai 
influenced by the reading pro ot te? 
of the subjects and by the mode 
inforcement employed. . “Recall 

Recall and relearning. mes 
score” is the number _ pelle * 
correctly anticipated 24 ho me’ 
the pander trials. “Relear? to 
Score” is the total number of trials ar 
quired to enable the subject to “he 
rectly anticipate form names for eo 
satis list presented in scrambled 

er. 

Mean recall and relearning eoore 
for the several groups by variable ee 
Presented in Table 6. It can be ae 
that in general the mean recall score 
vary in the same manner as the mea? 
relearning scores. For this reason a0 
because relearning efficiency probably 


TABLE 6 
Mean Recau AND RELEARNING Scorrs 
Grade Reading level Reinforcement 
4 6 G A P v A x 
Pee. ete 
Recall 2.86 | 3.17 | 3.39 3.56 | 2.72 | 3.14 | 3.47 3.0 3.25 
Relearning 2.67 | 2.11 | 2.44 2.08 | 2.50 | 2.31 | 2°44 2.92 | 2.22 


ACQUISITION AND RETENTION OF PAIRED ASSOCIATES 


ar satay reflects retention, 
failed : relearning scores were sub- 
Biabpat 0 analysis of variance. This 
Oe ie presented in Table 7. Only 
Table was found to be significant. 
omer 8 ek: that the mean relearn- 
tical or Grades 4 and 6 are iden- 
wlinigy Onsequently, it can be con- 
differen that the only significant 
tthe ee in relearning scores are at- 
Grete fe to the fact that second 
earn tha — took more trials to re- 
subjects did fourth and sixth grade 
oe ale 3 and the related ques- 

relearning ee in view of the 
re Sa 3: Good, average, and 
mean ders will not differ in the num- 

a te ps required to relearn a list 
quisit associates 24 hours after ac- 
ies on trials. And the question: 

reading ee effect of the variables— 
mode of evel, grade placement, and 
trials Teinforcement—upon total 
required to relearn the list after 


ri 
No aaa) pothesis is accepted: there is 
erence in relearning perform- 


an 
etd on to reading level of the sub- 
hat. he answer to the question is 
alon grade placement of the subjects 
aft © ‘significantly effects relearning 
er 24 hours, 
twee at there were no di 
in ae good, average, and poor readers 
Se Tials required to relearn. the list 
lng to be a significant finding. This 
Sgests that given sufficient yeinforce- 
a nt the future performance a ee 
Verage, and poor reader would ten! 


he cent while subjects 
© similar. Thus numbers of 


Might require differe? 
; require differe’ : 

Tials to wl a task they might be 
Expected to attain 2b° 
Ormance level. This 
Of the rationale PrOP ae 
he use of “tea ing ee”? 
“autoinstruction® devices: 


fferences be- 


PAT 


TABLE 7 
ANaLysis OF VARIANCE OF 
RELEARNING TRIALS 


Source af MS PF 
Grade (G) 2 | 3.70 | 5.69** 
Level (L) 2 | 1.57 | 2.42 
Reinforcement (R) 2 .97 | 1.49 
GXxXL 4 46 
GxXR 4 18 
LXR 4 24 
GXLXR 8 98 | 1.51 
Within 81 65 
p< 01. 


Association test. The association 
test results indicated that the poor 
readers gave more associations to the 
individual stimuli of the paired-associ- 
ates list than did the good readers. As 
pointed out, space limitations preclude 
discussion of the association test; but 
the implication appears to be that 
there is no clear-cut relationship be- 
tween the number of associations 
evoked (fluency) and efficiency in the 
present learning task. 


SumMARY AND CONCLUSIONS 


3 factorial design was 


A3xX38xX 
the effects of three 


employed to study 
levels of reading proficiency, three 
levels of grade placement (chrono- 
logical age), and three modes of rein- 
forcement upon acquisition of a list 
of paired associates. One hundred and 
eight childen whose IQ scores were in 
the average, 95-110, range provided 
data for the investigation. 

The results warrant the following 
conclusions: 

1. Good, average, and poor readers— 
in that order—require increasingly 
more trials to master a list of paired 
associates. 

2. Sixth, fourth, and seco: 
ers—in that order—require se Ae 
ing number of trials to master a list of 
paired associates ; but these differences 


248 


are significantly influenced by both 
the reading proficiency of the subjects 
involved and the mode of reinforce- 
ment employed. 

3. Mode of reinforcement as a vari- 
able interacts significantly with grade 
level. Therefore, studies purporting 
to investigate the relative efficiency of 
different modes of reinforcement must 
recognize grade level of the subjects 
as an effective variable, 

4. There is not a significant inter- 
action between mode of reinforcement 
and reading level. This is contrary to 
the assumption in instructional prac- 
tice that additional cues will be of 
value in a remedial teaching/learning 
situation. 

5. Analysis of total r 
showed grade level to be the only sig- 
nificant variable. Thus, it appears that 
once they have mastered a list’ of 
paired associates, poor readers will re- 
tain the associations as well as good 
readers. 

6. Analysis of the results of the as- 
sociation test indicates that the differ- 
ences in ability to learn a list of paired 
associates demonstrated by good, aver- 
age, and poor readers cannot be ex- 
plained simply in terms of number of 
meanings or associations attached to 
the stimuli. 

Two limitations seem obvious in 
considering the Significance of the 
present study for the teaching/learn- 
ing of reading: (a) The universality 

of the present results needs to be dem- 
onstrated. (b) The functional rela- 
tionship of the present task to learn- 
ing to read is not known. It is felt, 
however, that the present results can 
be of value in formulating hypotheses 
to be evaluated when reading tasks 
are employed in the experimental 
setup. 


elearning scores 


WAYNE OTTO 


REFERENCES 


_ an- 
ArcHer, E. J. Re-evaluation at ee! 
ingfulness of all possible CV 10, ‘Whole 
Psychol. Monogr., 1960, 74(10, 
No. 497). i etic 
ie A. The influence of the Kinestte A 
factor in the perception of Se. Psy- 
partial reading — Jie 
chol., 1939, 30, 187-198. ack- 
Burt, C., & Lewis, R. B., Teaching Oral, 
ward readers. Brit. J. educ. 
1946, 16, 116-132. ae = 
Epwarps, ALL. Experimental design i 
chological research. New York: 
ee G. A_ psychological pal 
into the nature of the condition ‘Brain, 
as congenital word blindness. 
1921, 44, 286-307. a ee ing 
Gates, A. I. Interest and ability in reading 
New York: Macmillan, 1930. eading: 
Gates, A. I. The improvement at 1935- 
(Rev. ed.) New York: Maemil ee its 
Hicks, E. M. Disability in reading ne . 
relation to personality. Cambridge: 
vard Univer. Press, 1926. | tracing 
Kix, 8. A. The influence of manual a 
on the learning of simple words 1 sy 
case of subnormal boys. J. educ- 
chol., 1933, 24, 525-535. t read: 
ONROE, Marion. Children who canno 
hicago: Univer. Chicago Press, 1 
Raynonp, Dororny M. The performanc ea 
reading achievers on memory ame eee 
ssociative learning tests. J. educ. 
1955, 48, 455-465, 7 
Rivxisp, H.C. The development of a grouP 


e of 


= s 4 ion 
technique in teaching word recognitio? . 


to determine which of four methods a 
most effective with individual childre fis 
Unpublished doctoral dissertation, U® 
Versity of Florida, 1958. ‘ ic 
uFFER, R. G. Certain psychological mA 
festations of retarded readers. J. edUc- 
Res., 1948, 4.1, 436-452. 5 
Tuxey, J. W. Comparing individual meat 
Te analysis of variance, Biometrics: 
9, 5, 99. ; 
FRNON, M. D. Backwardness in reading: 
Cambridge: Cambridge Univer. Press 


Wiss, Gerrr: 


Sta 


‘upp H. An investigation of 

the relative values of certain word-learn- 

ing techniques for retarded readers. UD- 

published doctoral dissertation, Temple 
niversity, 1952. 


(Received March 8, 1961) 


Journal of Ec 
ducati P: 
1! ‘onal 
961, Vol. 52, No. 5, 249- auchslooy 


THE CONTRIBUTION OF RESPONSE 
IN MENTAL MEASUREMENT 


WILLAVENE WOLF 
Ohio State University 
Around th 


chol 
the 


‘ e turn of the centu sy- 
Og) . : ; Ty, psy 
min active in investigating 
Sponge 2a po between speed of re- 
Seen, for criteria of intelligence, as 
Wee Choa, oe nee ome Ot 
of the eal and Wissler (1901). Most 
Motor Rod workers dealt with simple 
studies  Po"ses; a8 in reaction time 
SPonses 7. best with simple re- 
SUpport f ere appears to be little 
Speed of or the existence of a general 
(1931) han one factor, as Tinker 
Negligib] las pointed out. Generally 
ained nner ig have been ob- 
Simple i . speed of response on 
Senee, Wh, ks and measures of intelli- 
Nave bee hen significant relationships 
erally oe obtained, they have gen- 
the Shen interpreted as a function 
“nployed, ‘acter of the specific task 
Songs, it appears that rate of re- 
Tate of on a speed test—one on which 
A a oe the main determi- 
Telateg — score—is essentially un- 
Mtelli performance on measures of 
a te Tt does not necessarily 
eae however, that rate of response 
of fae = Nad test is unrelated to criteria 
fully elligence. A subject who success- 
asks completes difficult intellectual 
abilit quickly may have 
Ut Y than one who com 
se do so slowly. 12 
owe it is reasonable to 
(Logan oakes for speed, 
9) and Tinker (1 
0 addition to 4 ¢° 


Seneral the W: 
problem, tonligence t 


meern a 
riters were In- 
ests: Ra- 


€rested : ] 
ene in pide atrices (revised 
rogres! q the Leiter Interna- 


cale—both non- 
249 


Order, 1956) an 


ional] Performanc® Ss 


AND 


JAMES B, STROUD 
State University of Iowa 


verbal tests. Specifically, we were 
interested in investigating the rela- 
tionships between rate of response to 
the individual items on these two tests 
and the total score (i.e., number right 
on the two tests, respectively) , and 
certain independent measures. Some 
recent work with Progressive Matri- 
ces yielded rather low correlations be- 
tween total scores and achievement 
measures, around .40 (Knief & Stroud, 
1959). The form of the test and the 
method of administration make this 
an ideal test for use with the deaf and 
others handicapped in the use of a 
language. Obviously any improvement 
in its predictive power would be to the 
good. Some items from the Leiter were 
selected and used simply to further 
the interest in the general problem— 
the contribution of rate scores on & 
power test. 
PROCEDURE 

this investigation 


The subjects in 
de pupils in two 


were 102 fifth gra 
small midwestern communities, repre- 
senting the entire fifth grade enroll- 
ment in the two communities. To these 
pupils were administered the Progres- 
sive Matrices, one-half of the tests in 
the Leiter scale, the Iowa Tests of 
Basic Skills, and the Lorge-Thorndike 
Intelligence _ test, Verbal Battery, 
Level 3. The Leiter tests used, starting 
with Year VI, were as follows: VI, 2 
and 3; VII, 2 and 4; VIII, 2 and 4; 
IX, 3 and 4; X, 2 and 3; XI, 1 and As 
XII, 2 and 4; XIII, 1 and 4; XIV 1 
and 3; XV, 1 and 4; XVI, 3 and 4: 
XVII, 2 and 3. Progressive Matrices 
and the Leiter scale were administered 
individually. The examiner timed the 


250 


response to each item by means 2f a 
stop watch. The only alteration made 
in the directions was the addition of 
the sentence, “Do the items as quickly 
as you can, trying to get all the an- 
swers right.” 

The mean Lorge-Thorndike 1Q of 
the sample was 109. The SD was 14. 
The mean PM raw score was 27.85. 
The SD was 8.19. The distribution was 
approximately normal. 

In addition to the number right 
score on Progressive Matrices and the 
mental age score on the Leiter scale, a 
time score, based upon the average 
time required to compile the items 
done correctly, was computed for each 
pupil. The MA value of each item used 
on the Leiter scale was doubled, in- 
asmuch as only one-half of the items 
was used. Progressive Matrices is 
made up of items of ascending diffi- 
culty. Thus it seemed probable that 
the time scores on some parts of the 
scale might be more significant than 
those on other parts. The scale was 
divided into fourths, numerically, and 
average time scores were computed for 
each pupil on each fourth of the scale, 
making it possible to determine the 
relation between speed of response and 
the criterion measures for each fourth 
of the scale. 

Time allowances on the Iowa Tests 
of Basic Skills are rather liberal, but 
the test does impose time limits, The 
number of subjects finishing each sub- 
test was ascertained. When fewer than 
90% of the subjects had finished 9 
subtest, questions at the end were 
eliminated—back to a point reached 
by 90%. 


RESULTS 


As a first step in the analyses of the 
data, zero order correlations were 
computed (a) between average time 
score on the PM and Leiter tests anq 
TTBS and Lorge-Thorndike scores and 


WILLAVENE WOLF AND JAMES B. STROUD 


: veg OD 
(b) between number right IT 
the PM and Leiter (MAS) : "The re- 
and Lorge-Thorndike ee iin cor- 
sults are reported in Table - Matri- 
relations between ie eg one 
ces and the Leiter scale on Th on. 
hand and ITBS and the pre urls 
dike test, on the ather, or estigatio® 
ingly low. Since in this 1”) ‘tered } 
the two tests were not ore re- 
the standard way, the one taken 
ported in Table 1 should no redictive 
as representative of the P 

wer of the tests. iver 
pon order to determine the offen ne 
ness of combining the averas' res 0 
scores with the number right rs pre- 
the PM and Leiter tests 0 severne 
diction of the composite achi¢ - 
scores and verbal intelligent 4 es 
Scores, multiple correlation pro < 
were utilized. Table 1 gives th - 
tiple correlation coefficients 1° 
variables involved. The sndike 
ITBS scores and the Lorge-Thet de- 
test scores are regarded as an a 
pendent variables, and average 
Scores and number right scores 0D él 
PM and Leiter as the indepen 
variables. Beta weights were comP are 
and tested for significance. They ings 
also reported in Table 1, Combin!?© 
in multiple correlation, time and 0U ter 
ber right scores on the PM and ae ee 
tests increased these tests’ predict! ee 
on achievement and verbal intellige™ 
Scores to about the same degree. the 
beta weights, except that involving ° a 
Leiter time and the Lorge-Thorna!™ 
Scores, were significant at the .01 leve” 
Thus, the addition of time scores ; 

and Leiter number right score 

may be considered to have increase 
Significantly the correlations betwee? 
these two tests and the ITBS and 
Lorge-Thorndike scores. 

The multiple corre 
indicated that making 
the scores increased 


lation analysis 
time a part 0 
the predictive 


SPEED OF RESPONSE IN MENTAL MEASUREMENT 


va 
oe! both the PM and Leiter tests. 
sat te re ee test did not lend it- 
jest Mies er analysis, since the sub- 
jects yan pted by the respective sub- 
unde seers further analysis was 
ees, To det on the Progressive Matri- 
on any f ermine whether or not time 
Scipeuueu wa of the PM was more 
Was time o or predictive purposes than 
Was divid " any other fourth, the PM 
and the &d into sets of 14 or 15 items 
set were cant time scores on each 
Sta The with the ITBS and 
epresent rndike scores. The fourths 
of difficult ony four different levels 
Scores on y. Only 74 subjects earned 
Die he each of the four sets of items. 
cases, sa was based on 74 
correlatio e2 presents the zero order 
crage Sones obtained between the av- 
and the . Se on each set of items 
Scores, "Th BS and Lorge-Thorndike 
ion was e only significant correla- 
for the Nee between average time 
and the ye set of items on the PM 
ulti ee naeneie test. 
Puted . € correlations were com- 
ime seon determine whether or not 
n the PM on any fourth of the items 
total when combined with the 
higher umber right scores yielded 
Rs than those of any other 


Conmis TABLE 1 
aime BETWEEN THE RAVEN AND 
R Tests anp ITBS AND Loro#- 
Tuornpixe Test AND BETA 


WEIGHTS 
Var; 
atiable | rrRs 6 LT 6 
R 
piven Time | —17a_ | —-35t" _.173 | —-37** 
Mies See | | ane || ee Tags? 
risht e 
Ultiple .420°* 
titer Time ane gore | -078 ere 
i : F gs7** | 
ter MA woes) ce || Ayano 


Muy 4 
Ultiple “310°* 
Raven Right, r = — 485; 
.750- 


Lei ote Raven ‘Time YS: 

cr Time vs. Leiter Ri 

"Significant at .05 level 
* Signifieant at 01 level 


251 


TABLE 2 
Zero ORDER CORRELATIONS BETWEEN Av- 
ERAGE TIME Scores FoR Eacu FourtTa 
OF THE PROGRESSIVE MatRIcEs AND 
ITBS, Lorce-THORNDISE, AND 
Toran NuMBER RicHT 


on PM 
(N = 74) 
Average time ITBS ae tive 
All correct items on the | —.175 | —.282 
first fourth of Progres- 
sive Matrices 
All correct items on the —.178 | —.147 
second fourth of Pro- 
gressive Matrices 
All correct items on the | — .231 | —.258* 
third fourth of Pro- 
gressive Matrices 
All correct items on the | — .104 | —.030 
last fourth of Progres- 
sive Matrices 


Time on first 14 items Vs- Raven 
Raven Time on second 16 items v8. 
Raven Right, r = .230; Raven Time on third 15 items 
ys. Raven Right, 7 = —.458**; Raven ‘Time on fourth 
15 items vs. Raven Right, r = —.298°. 

* Significant at .05 level. 

** Significant at -01 level. 


Note.—Raven 
Right, r = -157; 


fourth. The results are reported in 
Table 3. Beta weights were computed 
and tested for significance. They are 
also reported in Table 3. 

The Rs involving the time scores of 
the third fourth of items are higher 
than those jnvolving any other fourth. 
One might have predicted that the 
magnitude of the Rs would have in- 
creased with the difficulty of the 
items; i.e., that those of each succeed- 
ing fourth of the items would have 
been greater than those of the pre- 
ceeding fourth. Indeed a tendency in 
this direction was evidenced for the 
first three fourths. It seemed likely 
that the items comprising the last 
fourth of the test would, for these fifth 
grade pupils, be quite difficult and 
that, therefore, the average time 
scores of the various pupils would be 
based upon relatively few correctly 


8 
& 


TABLE 3 


Mottipte Corretations Invoryinc Eacu 
Fourra or PM Irems 


Variable R B B 
6.15 .30* 1 = —.21 5 = 
6.25 .387** 2 = —.34** 5 = 
6.35 .45** 3 = —.46** 5 = 
6.45 33* 4 = —.30 5 = 
7.15 87" 1 = —.28* § = 
7.25 .38** 2 = —.39* 5 = 
7.385 .52** 3 = —.59** 5 = 
7.45 .32* 4 = —.29 5 = 


Note.—1. Average Time on first 14 items on Raven’s 
PM. 


Average Time on second 15 items on Raven’s 
PM. 


w 


2 


. Average Time on third 15 items on Raven’s 
PM. 


- Average Time on fourth 15 items on Raven’s 
PM. 
5. Number right on complete Raven’s PM, 
6. Iowa Tests of Basic Skills. 
7, Lorge-Thorndike Intelligence Test. 
* Significant at .05 level. 
** Significant at .01 level. 


> 


done items. This, if true, should tend 
to lower the reliability of the average 
time scores for this fourth of the test. 
Accordingly, the number of items suc- 
cessfully completed in each fourth of 
the test, and upon which the average 
times were based, was ascertained, 
These, for the successive fourths, were 
found to be 12.7, 7.7, 6.5, and 2.1. 
Thus there is room for doubt that the 
apparent exception to the trend for the 
r’s to increase in magnitude seen at 
the last fourth of the test is genuine, 
When those pupils who got only one 
right or none, were eliminated, the 
mean number right was still only 2.8 
on the last fourth of the test, 

One additional treatment of the data 
for Progressive Matrices was under- 
taken. Median time scores were com- 
puted for each item and one point wag 
given for each time score which was 
faster than the median score. These 
points for time were then added to the 
number right score on the PM for each 
pupil and a new score obtained. Scores 


WILLAVENE WOLF AND JAMES B. STROUD 


ith 
thus derived were then correlated 
the criterion measures. oe the 
.35 and .39 were obtained ae Taree 
new PM scores and ITBS ae as 
Thorndike scores, respectively, een 
: i d .254 betw 
compared with .217 an nd the 
the PM number right _— a 
same two criterion measures. re- 
The data suggests that speed ober 
sponse when combined vee 
right on the PM and Leiter "petwee® 
significantly the meeleeee Lorge- 
these tests and the ITBS an 
Thorndike scores. : rT 
In one recent investigation an iC 
age r of .41 was obtained peo Sarin 
gressive Matrices scores an 4 
ITBS subtest scores, on 16 ‘4, 1959): 
grade pupils (Knief & Strou an 
Most reported correlations t thaD 
have seen run somewhat higher a (N 
this. In the present investigate ned 
= 102), an r of .217 was 0 ig 
between PM scores (number Fs AS 
and ITBS composite scores a0 ree" 
of .254 between PM scores and Lo 
Thorndike verbal scores. seas ms 
(based upon one-half of the i per 
correlated with these same two & a 
rion measures to the extent of .291 ie 
357. When multiple Rs were compu” 
between (a) PM number right at 
average time scores and (b) Le! 2 
MA and average time scores and the re 
Same two criterion measures, Mg 
of .380 and .420 were obtained for P 
and of .310 and .460 for the Leiter. z 
Coefficient of —.465 was obtained her 
tween average time values and nu@ 
ber right scores on PM, and of —-7 
between Raven average time valueé 
and MAs,. 
What the results of this investiga 
tion tend to show is that the spee 
with which correct responses are made 
on the PM and Leiter tests, when com- 
bined with the conventional scores; 
makes a significant contribution to the 
predictive power of these two tests, as 


SPEED OF RESPONSE IN MENTAL MEASUREMENT 


Signi a 

ae m4 by increased correlations with 
ots ee measures. It is true, of 
atte bay, the correlations are still 
aint + ee the investigation does 
enie he importance of speed of 
ay the measurement of intel- 
propositi hus the results support the 
2h ote ve @ person who can per- 
ones, gui ic feats, especially difficult 
than ate ly should be judged to have 
slowly. ity than one who does so 


SuMMARY 


To iny 


aft. estigate speed of response as 


‘ac 
rad A mental measurement, the 
eral, e Matrices and the Leiter 
a ministe nal Performance Scale were 
2 fifth ved in individual sessions to 
Was Nd pupils, and a time score 
6 a for each pupil. 
Ogether ri of the time values 
Tesulted 4 ith the conventional scores 
WO tests in correlations between the 
and the pas question (PM and Leiter) 
ests of Aid criterion tests, the Iowa 
orndik asic Skills and the Lorge- 
the og ike, significantly higher than 
trelations obtained between the 


’ 


253 


conventional scores alone and the two 
criterion tests. Further analysis of the 
PM data suggested that the contribu- 
tion of the time values increased from 
fourth to fourth of the items, and thus 
with difficulty, through the third 
fourth. 

The results imply that the person 
who is able to deal more quickly with 
intelligence test items has more ability 
than the slower, but equally accurate 


individual. 
REFERENCES 


Gupert, J. A. Researches on the 
physical development of scho 
Stud. Yale Psychol. Lab., 1894, 


mental and 
ol children. 
2, 40- 


100. 

Kater, L. M., & Stroup, J. B. Intercorrela- 
tions among various jntelligence, 
achievement, and social class scores. vs 
educ. Psychol., 1959, 50, 117-120. 

Kun umany, F. Tests of mental development. 
Minneapolis: Educational Test Bureau, 
1939. 

Tivxer, M. A. The significance of speed in 
test response. Psychol. Rev. 1931, 38, 
450-454. 

Wisster, C. The correlation of mental and 
physical tests. Psychol. Monogr. 1901, 
3(6, Whole No. 16). 


(Received March 15, 1961) 


Journal of Educational Psychology 
1961, Vol. 52, No. 5, 254-261 


EFFECTS OF TRAINING AND ANXIETY UPON 
TEACHERS’ PREFERENCES FOR INFORMA- 
TION ABOUT STUDENTS’ 


PHILIP J. RUNKEL axp DORA E. DAMRIN 
Bureau of Educational Research, University of Illinois 


A characteristic of the individual’s 
perceptual or cognitive space which 
captures much attention in current lit- 
erature—e.g., Bieri (1955), Bieri and 
Blacker (1956), Leventhal (1957), 
Lundy and Berkowitz (1957)—is the 
“complexity-simplicity” of the frame 
of reference within which the individ- 
ual perceives environmental objects. 
Complexity-simplicity can be con- 
ceived as the dimensionality of the 
cognitive space. By dimensionality we 
do not refer to the nature or content 
of the dimensions mediating the re- 
sponse, but only to the sheer number 
of the dimensions used by the indi- 
vidual in discriminating environmen- 
tal objects. This paper examines two 
hypotheses: (a) that training is re- 
lated to cognitive dimensionality, and 
(b) so also is anxiety. The tests of 
these hypotheses were set in the con- 
text of teachers’ thinking about prob- 
lems of students. 


Dimensionality and Knowledge 


Let us consider a cou 
notions about trainin 
On the one hand, it is 
training serves to rule out irrelevan- 
cies. By focusing attention upon what 
is relevant and important, tr 


ss ‘aining re- 
stricts the number of criteria by which 
matters are judged. On the other hand 
a proposition dear to educators is that 
training is liberating; it provides the 
individual with an enhanced reper- 
toire of criteria from which he can 


ple of common 
& or knowledge. 
widely held that 


+The research reported herein was ts 
formed pursuant to a contract with the 
Office of Education, United States Depart- 
ment of Health, Education, and Welfare, 


254 


select the most appropriate to guid 
his judgment. Whereas the untraine 
individual is more prone to adopt sa 
irrelevant dimension or one given B y 
his mood of the moment, the individ- 
ual with sufficient training possesses 4 
kit bag of relevant dimensions and ca? 
judiciously select from it dimensions 
suitable in different situations. From 
this point of view training viene see 
the number of dimensions which, a 
any given moment, the individual can 
draw upon. : 
These two ideas are not contradic: 
tory. The first says that the untraine 
person’s criteria or dimensions of judg- 
ment depend on his mood of the m0- 
ment or upon surface irrelevancies a 
the situation, so that in meeting * 
series of situations he utilizes a larg° 
number of dimensions. Although ue 
might react on the basis only of one 0 
very few dimensions in any one situa 
tion, his dimensions over many situa- 
tions would be numerous. The seco? 
phase begins when the individual ac 
quires a little knowledge. He the? 
looks for certain relevant and import- 
ant characteristics no matter what the 
Situation, so that the tally of dimen- 
Sions he uses over a series of situations 
1s small. Then with further training, 
the Person comes to recognize more 
criteria as relevant to each situation, 
Selecting certain ones as more impo! 
tant im some situations and others 25 
more important in others; conse- 
quently the dimensionality of his judg- 
ments over situations increases, 
_These ideas would lead one to pre- 
dict that. the number of dimensions 
likely to be used in a single situation 


TRAINING, ANXIETY, AND 


would increase with training. Over a 
Series of situations, however, dimen- 
sionality would first decrease and then 
increase as knowledge was acquired. 
The particular prediction made in 
terms of our data will be better under- 
stood if we first describe our instru- 
ments and method. 


MerrtHop 


A pgs in 28 secondary schools in, Illi- 
etl a given a questionnaire describing 
minh 4 imaginary problems which a teacher 
ote encounter with students. For each 
is ae the teacher was asked to express 
Hen ee among six types of informa- 
dheckren he might seek in order to deal 
aloia 8 with the problem. Problem 1, 
Sven ith a condensation of the instructions 
to respondents, looked like this: 


Problem 1 


a McCauley has not turned in a single 
went Wee since the beginning of the 
ag You have warned her that failure to 
failing her homework will result in her 
Sh, g the course, but this has no effect. 
sits and daydreams most of the time 
she is in class. 
in a working guess, where would you say 
ch source of this difficulty might lie? To 
A eck up on your working guess, what 
additional kinds of information would you 
Seek? Please number, in order, the types 
of information (no more than three). 


Be 
- e 8 8 pe 
Scy B2 8 we 98 28 
gos a= 6 a8 25 33 
g Fa 
S38 589 #8 SH 8B £538 
Bo 2h S #88 SE B89 
Sas ES 2 Se 3a §aa 
ati a 4 SB] GI S 
Ss a a 
< & a 2 A 
——— 


To illustrate further, three more of the prob- 


©ms follow: 
Virginia Smit 
daughter to go 
erself wants very 


vant their 
h’s parents want thei 
to college, and Virginia 
much to g0- However, 
¢ to major in. 


she fF hai ae 
=“ “ seen sent to the principal’s 
dhe a, different teachers for his 
ay EW. © their classes. 


¥ <a behavior in 

oes pete jnstance is, “The teacher 
Started it.” that the bi 

ers report oy 

mere hire so tired ayers ay pre | 

shoo n their classes. 

is always falling si 


asleep 1} 


TEACHERS’ PERCEPTIONS 255 


The school physician reports that he is 
basically in good health. 


Each teacher chose three of the types of 
information and ordered them 1-2-3 accord- 
ing to his preference, and he did this 12 
times, once for each problem. Using the 12 
rankings given by a teacher (one ranking of 
three things for each of the 12 problems), it 
was possible to attempt to “unfold” them 
into one dimension according to the tech- 
nique of Coombs (1953). 

Unidimensional unfolding is a method for 
ascertaining whether an obtained rank order 
(or a set of rank orders) might have been 
generated from an underlying order. For ex- 
ample, suppose that a man with a wife and 
two children finds that six houses are avail- 
able on the market, one with a combination 
living and sleeping room, another with one 
bedroom, another with two bedrooms, and 
so on up to five bedrooms. He might most 
prefer the house with three bedrooms, next 
the one with four, and thirdly the one with 
five. When he becomes the father of a third 

child, he might then most prefer the house 
with four bedrooms, then the one with five, 
and thirdly the one with three bedrooms. 
Both these rankings of three of the houses 
out of six could have been generated from 
the underlying dimension of the number of 
bedrooms in the houses. But if a man most 
preferred the house with, say, two bedrooms, 
he could not (on the basis of the number of 
bedrooms alone) next prefer the house with 
five—his next choice would have to be either 
ihe house with three bedrooms or the house 
with one. Thus some responses in rank order 
form are “unfoldable” into the number of 
bedrooms dimension and some are not. A 
more extended discussion of unfolding can 
be found in Coombs (1953). 

An interpretation of unfolding in terms 
of the materials used in the present study 
might be illustrated as follows. Suppose 
that a teacher judged student problems in 
terms of what we might loosely call a 
“hard-soft” dimension. For one problem the 
teacher might feel that very “hard” infor- 
mation was most suitable, choosing first in- 
telligence, then achievement, then interests, 
then family life, ete. On another problem 
the teacher might feel that an intermediate 
type of information was most suitable, 
choosing first interests, then achievement, 
then family life, then intelligence, etc. Both 
these preference orders could unfold into the 
same dimension, as will be seen below. 

The unfolding technique does not, of 
course, depend on guessing what basis’ the 


256 


TABLE 1 
Intusrration or How THE RESPONSES OF 
ONE SuBsEcT To THE 12 PROBLEMS 
Firrep into an UnroLpinG SET 


Possible preference orders 
(each column shows the 
Tesponse to some problem) 


Type of information 


Intelligence 

Interests and ambitions 

Achievement in school sub- 
jects 

Personality and character 

Family and home life 

Fears and worries 


Number of problems to 12 11 Lilt 
which subject gave re- 
sponse shown in column® 


® Responses of the subject which would not unfold 
with the other nine in this set were: Interests-family- 
fears, family-interests-personality, and fears-intelli- 
gence-interests, 


subject might be using in organizing his 
preferences. The technique is entirely ob- 
jective. As an illustration, the responses of 
one of the subjects are shown in Table 1. 
The 1-2-3 entries in the body of the table 
show one set of preference-orders which 
could be generated by the “underlying” 
order of information types listed at the left. 
For any given underlying order of the in- 
formation types, and under the instruction 
to order three out of the six, there are eight 
sets (the table shows only one) of 13 re- 
sponse patterns, each being a set which 
could be generated by the underlying order 
when the respondent chooses, in order, the 
types nearest his “ideal” type. Eight sets 
are possible because of the effects of relative 
distances among the types; this, as well ag 
other details of the unfolding technique, is 
explained by Coombs (1953). 

The response patterns given by our il- 
lustrative subject are indicated at the bot- 
tom of the table. The particular unfolding 
set represented in the table accounted for 
nine of the subject’s responses, and this was 
the most that any unfolding set would ac- 

odate. 

OE reg one of a teacher's 12 rankings 
failed to unfold with his others, we knew 
that at least two dimensions were required 
to encompass the 12 responses. We then 
went on to assume that the fewer of the 
teacher’s 12 responses that would unfold to- 

ther, the higher his dimensionality was 
an ely to be. This would not necessarily be 


PHILIP J. RUNKEL AND DORA E. DAMRIN 


se was 
true, of course. But aig nee that it 
more likely to be true than not. . up- 
The maximum number of ee 
foldable into a single dimension W ional 
directly as the individual’s pels 4 
“score.” Since there were 12 prot poe 
dimensionality score could range responses 
12. A large number of unfoldable Hes ‘versa: 
means low dimensionality, and vA from 
Actually, the obtained scores ra nensioD- 
(high dimensionality) to 10 (low aie oe 
ality). If a teacher gave an unu! he was 
sponse to more than one problem, jonality 
coded NA and not given a dimens 
score. is o 
The unfolding technique thus Lei “yas 
rough index of whether the agers ss 
using relatively few or relatively = SE 
mensions in thinking about the type ith 
formation he would need in dealing W 
Problems described tohim. we 
As a measure of training ray 
used a 60-item test developed in oe wledge 
of our larger study: a test of ao did 
and Interpretation of Tests (KIT). rience 
not want a measure of years of exP dents, 
in dealing with information about os) ex 
since, as it is often phrased, 10 year jence 
Perience often consists of 1 year's oa 
repeated 10 times over. Nor did we W e to 
direct measure of amount of expose dif- 
training, since there are great individua 8 
ferences in the degree to which ans ol 
“takes,” especially when assessed a ye ing: 
more after the last exposure to vies 
Onsequently, we wanted what migh of 
called a measure of the residual effects 
training. ed 
Some of the types of information offer . 
the respondent were types typically el 
tained through the use of tests. This ee 
cumstance, together with the fact that tral ts 
ing in the use of information about studen 
usually includes training in constructing ae 
interpreting tests, persuaded us that for ¢ 
Purposes of our hypothesis a measure Ws 
knowledge about testing would suffice is 
index the extent to which the individua 
training would affect his tendencies in the 
use of information about students. Conse 
auently, scores on the KIT were distribute 
against the dimensionality scores to test the 
hypothesis that increased training is ass0- 
ciated with different dimensionalities, Some 
representative items from the KIT are give? 
elow. 

In counseling junior 
dents a teacher should 
in her interpretation 0} 
she obtains from: 


the 


in- 
the 


high school stu- 
be most cautious 
f the informatior 


TRAININ a ay 
4INING, ANXIETY, AND TEACHERS’ PERCEPTIONS 


A. vocati i 
ational interest test 
B. md skills tests _“ 
- reading comprehensi 
nsion 
- Performance tests ai 


A stu 
ona Ea aay at the 75th percentile 
means that: zed achievement test. This 


A. 75% 
than he did f the norm group scored lower 


15% 
than he an the norm group scored higher 


C. he 
correctly answered 75% of the questions 


Duhe is j 
high school tod upper 25% of his own 


Ona ‘ 
pil would her intelligence test, which pu- 
at is ae most apt to receive an IQ 
g Mears lower than his true ability? 
Spanish , Whose parents speak only 
3. John, w : : 
subjects a aaa in all of his school 
« Jan 
le class aig comes from a lower mid- 
- Bi 
athletion? who spends all of his time in 
instead of on his school work 


The ; 
should Benen arises what kind of relation 
xy a between our two meas- 
th, foldable nslonality: (number of mutually 
Tai KIT) mg and training (score oD 
‘ithin the ‘an we specify precisely where, 
imensionaliten wt covered by our measures, 
"© menti ity should increase or decrease? 
Would be re earlier that the predictions 
2 single gj ifferent if we were looking within 
Series of ituation or if we were looking at a 
lems 9, tuations. Was answering the Prob- 
Subjecta mnaire a single situation for our 
We cone was it a series of 12 situations? 
. Anoth hardly know in advance- 4 
MON yea 4 difficulty in choosing, predic- 
as the range of the K There was 
in advance whether 


No 
Way of determining in 


It ey 
tended high h to reach the hy- 
wattesized vent where dimensionality 
Id begin to rise. Be of these uD- 
¢ a precise 


Certaint 7 
fptainties, we could not pick ov f 
orm of relation: £6 predic _ If the question- 
ire ¥ y atioD and the range 
of th were a single situa. | could expect 
Qo a KIT were too He ‘Ip the ques- 
tiene, lation at all to show situations and the 
ran aire were a series erate, We could 
Be of KIT were Es g i 
exp ° the I ti decrease with 
pect dimension hy KIT were ample in 
van Wwledge. But if Teot frst 8 decrease in 
dines’ we aces nd then an inctenss: We 
ensionality * ts were a lequate 
Oped that our instrumen ; 


257 


and adopted the latter relation as our Hy- 
pothesis 1: Dimensionality in responses to 
the Problems Questionnaire wi first de- 
erease and then increase with score on the 


Kat; 


RESULTS 


Hypothesis 1 

Dividing the KIT scores into six in- 
tervals containing as nearly equal 
numbers of respondents as possible 
and entering the dimensionality scores 
for each KIT interval, we obtained the 
results shown in Table 2; males and 
females are tabulated separately. 

The means shown jn Table 2 are 
plotted graphically in Figure 1. Both 


TABLE 2 
FREQUENCIES OF DIMENSIONALITY Scores 
sive KIT LeveLs 


av SUCCES! 
KIT scores 
puns only ——————— 7 
jimensiona) ty 
14-| 24- | 27-| 30-] 33-| 57- 
M5 | 26 | 29 | 32 | 36 | 55 Total 
a fecal 
High 4 a) 19), a 
5 8 }10 38 
; of6| 48 
2{5 | 32 
i 3 |2 16 
Low 9 é Z 
a eee 
Ae log [go | 157 
* 5.8] 5.8 
gee oY 
KIT scores 
sRemales cai ——— oo eee ee 
imensionality | yq_| 23-| 27-] 30- | 32- | 35- 
10- | 25-175 | st | 34 | 46 | TO 
High 3 1 bs 
4 2/5 2]3]1 13 
5 9/5 |/s {6/5 |6 34 
6 3 fio |e | 7 a [32 49 
7 af3js|4]2|4] 2 
8 1 2|4|4 [5 16 
9 2 2 
Low 10 2 }1 3 
Totals 18 |23 |21 |25 |26 |28 
M41 
M 5.3| 5.5] 6.7) 6.4] 6.1] 6.3 


Note.—Chi square tests from tables colla 
3: Males x* = 11.13, p < 026; ereapee es 
pes p < .026; Females x? = 15.80, p < 


258 


Dimensionality 

High 5.8- 
6.0- 
6.2- 
6.4- 
6.6- 


Low 6.8- 


14-23 


Dimensionelity 

High 5.2- 
5.4- 
5.6- 
5.8- 
6.0- 
6.2- 
6.4- 
6.6- 


Low 6.8- 


10-22 23-26 27-29 30-31 


Fic. 1. Mean dimensionality at 


curves dip in the middle with a grati- 
fyingly high level of statistical con- 
fidence, and their general shape is rep- 
licated for males and females. Our 
theory suggests that the high di 
sionality of the low KIT sc 
resents a haphazard hopping from one 
dimension to another in going from 
problem to problem, whereas the mod- 
erate to high dimensionality of the 
high KIT scorers—so let us flatter 
ourselves—is the differentiated cogni- 
tive structure of the expert. The rela- 
tively low dimensionality of Tespond- 
ents with average knowledge might be 
called the simple sophomoric world of 
the semi-educated. 


men- 
orers rep- 


24-26 


PHILIP J. RUNKEL AND DORA E. DAMRIN 


MALES 


27-29 30-32 33-36 37-55 


KIT scores 


FEMALES 


32-34 35-46 
KIT scores 


Successive KIT levels. 


Anxiety and Dimensionality 


There is evidence in the literatur® 
that anxiety focuses or narrows the i 
Pects of the environment to which 0) 
organism responds, Easterbrook (195 
38 Particularly cogent on this aa 
OW, One way in which perception © 
attention can be focused is by reducing 
€ number of dimensions along whic! 
environmental objects are discrimi- 
nated; and our dimensionality measure 
is meant to indicate, roughly, just this: 
that the types of information pre- 
Sented to the teacher were discrimi- 
nated along more or fewer dimensions. 
We should therefore predict, (Hypoth- 
esis 2) that teachers with higher anx- 


TRAINING, ANXIETY, AND TEACHERS’ PERCEPTIONS 


lety wi 

thes pra show lower dimensionality 
respond: se with lower anxiety when 

Son Ing to the problems. 
“Self-Peree tine of anxiety was the 
formed ception Checklist” which 
our lar ne of a questionnaire used in 
selected it study. It was made up of 
Merman ee from the Guilford-Zim- 
and from emperament Survey (1949)? 
search In Saunders’ Personality Re- 
turns yw, ventory (1955). After the re- 
lyzed . in, these items were ana- 
Proved +  % homogeneity. Thirteen 
total shane have tetrachorie r’s with 
Were use pay 40 or better, and these 
on anxiet, o make up the total score 
Count of y. This score was merely the 
the key ea answered according to 
anxiety he security” as opposed to 
and the ol scores indicating security 
rms be} ow scores anxiety, the two 
This sheing meant to label a polarity. 
Teliable ‘. test was admittedly an un- 
lous ae and subject to var- 
answers. S such as social desirability 
Gicce it was the best 
With ene, we could make at the time 
dent, fativ of administration, respon- 
item igue, etc. Some examples of 
i are these: 

Sion eee fay get into a sta 
®ppenings? by thinking over 
fue? usually been optimistic about 
ip oh tod “ritcine peo 


Hy 
Pothesis 2 
the security 


or hen we distributed © 4 
te Tes against dimensionality with all 
Achers pooled the result was about 
= hear random as one ©? Jd get. How- 
Ver, since neither ° these measures 
as expressl designed to give a pow- 
erful a of The nypot hesis We decided 
© look further for ays of selecting 
issi ‘e used items from 

2 Used by permissiO®, which had been 


the Gui : er 
uilford-Zim™ A 
selected sO ee by Jay M. Jackson 


te of ten- 
the day’s 


Your 
ple without 


hurt 


259 


respondents for whom the security and 
dimensionality measures would be 
more discriminating. 

Other work carried out by Gerald 
Larson (unpublished) as a part of the 
larger study had suggested that our 
security measure was more valid for 
males than for females. Also, as Table 


2 shows, the KIT was not only asso- 


ciated with differences In dimension- 
ality, but the spread of dimensionality 
scores was somewhat greater for the 
middle group of KIT scores. Accord- 
ingly, we distributed security against 
dimensionality for each of the six 
groups formed by sex and three levels 
of KIT. The results for males are 
shown in Table 3; the results for fe- 
males were nowhere significant, and 
are not reported. 

For males, results remained random 
at low and high levels of KIT but be- 
came significant in the middle range. 
In brief, no subgroup of respondents 
showed a relation contrary to hypoth- 
esis; and one subgroup, selected for 
sensitivity to our unreliable measures, 
showed the predicted relation at a 
good level of confidence. As the center 
subtable of Table 3 shows, relatively 
more of the secure male teachers ex- 
hibited high dimensionality and rel- 
atively more of the insecure showed 


low dimensionality. 


Security vs. KIT 

Because both knowledge and anxiety 
showed some relationship to dimen- 
sionality we investigated their relation- 
ship with each other. Only random re- 
sults were obtained for females, but for 
males the relationship was monotonic 
This result is reported in Table 4, 
where it is shown that more knowl- 
edgeable males tend to be more secure 
than do less knowledgeable ones 
Whether the more secure males we : 
those who took courses in testin aa 
learned more from them or whothan 


260 


TABLE 3 
DIMENSIONALITY vs. SECURITY WITHIN 
KIT Levets 
Males Only 
Security 
Dimensionality Total 
Low High 
(0-10) (11-13) 
KIT of 10-25 
High 45 10 5 15 
6 ie 7 14 
Low 7-9 5 5 10 
Total 22 ay 39 
x® corrected for continuity = 0.32; ns. 
KIT of 26-35 
High 4-5 7 11 18 
6 4 15 19 
Low 7-9 20 12 32 
Total 31 38 69 


x? = 8.54, p < Ol (one-tailed). Itali- 
cized cell frequencies are higher than 
randomly expected. 


KIT of 36-54 
High 4-5 4 10 14 
6 5 5 10 
Low 7-9 2 9 11 
i 
Total 11 24 35 


x? corrected for continuity = 1.33; ns. 


they became more secure after acquir- 
ing more knowledge about testing is, 
of course, undetermined. 


Summary 


This paper is concerned with the 
effect of knowledge and of anxiety 
upon certain aspects of cognitive di- 
mensionality. Dimensionality is here 
defined as the total number of dimen- 
sions or preference factors a person 
employs in perceiving and evaluating 
specified complexities of environmen- 
tal stimuli. Individuals with high di- 


PHILIP J. RUNKEL AND DORA BE. DAMRIN 


mensionality employ a sizable number 
of dimensions in dealing with such 
stimuli; those with low dimensionality 
employ relatively few. 

In this research the stimuli con- 
sisted of six general kinds of informa- 
tion a teacher might select for use in 
working with students who presented 
problems typical of those frequently 
encountered in the classroom. Our aim 
was to investigate whether differences 
in teachers’ knowledge about such 
matters, and also whether differences 
in teachers’ personal feelings of secu- 
rity, affected cognitive dimensionality 
in ways consistent with those theorized 
in current literature. 

Hypotheses long held by educators 
combine to assert that training, by 
Serving to increase knowledge, at first 
restricts and later enlarges the indi- 
vidual’s cognitive framework. The un- 
trained person, having no established 
framework, gives responses primarily 
determined by the situation. Training 

st serves to decrease this variability 

Y Providing a few specific ways of 
Perceiving these stimuli regardless of 
he Situation, and later serves to eN- 
"eh cognitive dimensionality by p10- 
Viding the person with knowledge that 
eaups him with more criteria for dis- 


criminating the stimuli. The signifi- 
TABLE 4 
Srcurrry vs, KIT 
Males Only 
Security 
KIT scores |——___ Total 
OF BW fp ag 
36 
ae 5 11 9 8s} 35 


22 27 33 38 93] 143 


Note—Criti i 
monotonicity tet grt by J. E. K. Smith’s (1956) 
355 D < .02 (two-tailed). 


TRAINING, ANXIETY, AND TEACHERS’ PERCEPTIONS 


cant curvilinear relationship found be- 
tween teachers’ knowledge of testing 
and teachers’ perceptions of student 
problems strongly supported this hy- 
pothesis. Teachers with low and high 
levels of knowledge possessed high di- 
Mensionality: a finding we interpreted 
as indicating erratic versus specialized 
and differentiated perceptions. By 
contrast, teachers with average knowl- 
edge possessed relatively low dimen- 
Slonality, which we interpreted as in- 
dicative of the restricted simplicity of 
the partially trained individual. 
Another body of theory states that 
feelings of anxiety serve to narrow 
Perception to a few elements of the 
environment, We reasoned that, if 
true, such narrowing would be re- 
flected in a reduction of the number of 
dimensions the anxious individual em- 
Ploys in dealing with his environment. 
his hypothesis was supported by & 
Part of our data. No relation between 
anxiety and dimensionality was found 
Or female teachers, but a significant 
°he was found for male teachers who 
Possessed knowledge in the middle or 
verage range. We believe that more 
*eliable measures of anxiety and of 
mensionality might provide stronger 
Support for this second hypothesis. 


261 


REFERENCES 

Brent,‘ J. Cognitive complexity-simplicity 
and predictive behavior. J. abnorm. soc. 
Psychol., 1955, 51, 263-268. 

Breet, J., & Buacker, E. The generality of 
cognitive complexity in the perception 
of people and inkblots. J. abnorm. soc. 
Psychol., 1956, 53, 112-117. 

Coomss, C. H. Theory and methods of social 
measurement. In L. Festinger & D. 
Katz (Eds.), Research methods in the 
behavioral sciences. New York: Dryden, 
1953. Pp. 471-535. 

Easterbrook, J. A. The effect of emotion on 
cue utilization and the organization of 
behavior. Psychol. Rev., 1959, 66, 183- 
201. 

Guturorp, J. P., & Zrmerman, W. Guilford- 
Zimmerman Temperament Survey. Bev- 
erly Hills, Calif.: Sheridan Supply, 1949. 

LeventHAL, H. Cognitive processes and in- 
terpersonal predictions. J. abnorm. soc. 
Psychol., 1957, 55, 176-180. 

Lunpy, R. M., & Berxowrrz, L. Cognitive 
complexity and assimilative projection 
in attitude change. J. abnorm. soc. Psy- 
chol., 1957, 55, 34-87. . 

Saunpers, D. R. The Persorality Research 
Inventory. Princeton: Educational 
ing Service, 1955. "i 

Saurn, J. E. K. On the analysis of contin- 
gency tables with ordered classifications. 
Lexington, Mass.: Massachusetts Insti- 
tute of Technology, Lincoln Laboratory, 
1956. 


(Received March 29, 1961) 


Journal of Educational Psychology 
1961, Vol. 52, No. 5, 262-265 


LEARNING IN FOURTH AND SIXTH GRADERS AS 
A FUNCTION OF SENSORY MODE OF STIM- 
ULUS PRESENTATION AND OVERT OR 
COVERT PRACTICE! 


JOAN LOCKARD 


University of Wisconsin 


A number of studies have indicated 
a relatively consistent trend for sub- 
jects to learn more efficiently when 
material is presented simultaneously 
to two sensory channels (auditory + 
visual) than when a single sensory in- 
put is used (Day & Beach, 1950). 
When auditory versus visual modal- 
ities are considered, however, results 
are conflicting. Interpretation of the 
results is further complicated by the 
interaction of Sensory input variables 
with other variables. Early studies in- 
dicate that auditory presentation of 
material is more effective than visual 
for relatively young children (third, 
fourth, and fifth grades) but that the 
trend reverses as age increases (Erick- 
son & King, 1917; Russell, 1938). Sey- 
eral recent studies (Evans, Glaser, & 
Homme, 1960; Silverman & Alter 
1960), investigating response modes in 
automated instruction with college 
students, have suggested that the re- 
quirement to write a response may be 
less effective in learning than merely 
reading the same material. The pres- 
ent study, therefore, was designed in 
order to investigate the influence of 
three modes of stimulus presentation 
(auditory, visual, and auditory + vis- 
ual) and two modes of responding 
(overt or covert) upon learning in 
fourth and sixth grade children, 


’ 


1The authors wish to express their sincere 
appreciation to Robert Nardelli, Principal er 
the Campus Laboratory School, San Diego 
State College, for his cooperation during ae 
course of the experiment. 


262 


AND 


JOSEPH B. SIDOWSKI 
San Diego State College 


Meruop 


ixth 

Subjects. Eighteen fourth and canoe 
grade children from the Campus Lal ryed a8 
School at San Diego State College str ts for 
subjects. The mean IQ (California is was 
Mental Maturity) for the fourth gra 
119.7, and for the sixth grade, 124.6. were 

Procedure. Students in both erades ables 
required to learn lists of nonsense Mt ¢imu- 
under each of three sensory modes oF visual 
lus presentation: visual, auditory, se rade 
+ auditory. The subjects in each cee 
level were randomly assigned to two ret nine 
subgroups, overt or covert, consisting © ndi- 
Subjects each. In the overt response i aw 
tion, subjects were required to reca. 
write down the syllables after each 4 the 
trials. The covert groups merely een test 
syllables and were given a written reca. was 
only after the fourteenth trial. A per the 
defined as one complete presentation © 
list of 10 syllables, ac" 

Five lists of nonsense syllables, two P™ 


tice and three experimental, taken from Glaze 
tee) and of high association value ri 
fo 


90%) were used. In order to keep ee. 
Similarity at a minimum, a different ona 
Sonant started each of the syllables in # a 
and vowels Were used about equally hae 
ach list consisted of 10 syllables. An ou 
study by Sidowski and Bason (1960) Ne 
cated no significant difference in difficulty 
between the lists, Each list was presented = 
a total of 14 trials in each condition. Syllables 
wn’ Presented at a 2-second rate with @ : 
Second interval intervening between the pres 
entation of each syllable. The syllables wer 
spelled out during the 2 seconds for the audi- 
tory conditions by means of a Webcor tape 
recorder, Visual stimuli were presented at eye 
level] by an American Optical Performance 


Syllables that he remembered dupj #3 jn- 
terval on each of the 14 trials, sey ae 
covert groups merely gat, and waited for the 


% 


A 


LEARNING AN 
EARNING AND SENSORY MODE OF PRESENTATION 


Next tri . 
heute ac subjects were instructed to re- 
since they many of the syllables as possible 
through the Ii ould be tested after going 
Jects in all « ist 14 times. After Trial 14, sub- 

ali groups were allowed as much time 


as Was 1 
syllables 4 to recall and write down the 


Design, E : 
Ment ae Pek subject served in the experi- 
days. The ong per day for 4 consecutive 
Which the rst day was a practice day during 
auditorily Sites learned one practice list 

e following ei visually. On each of 

er one of tl ays, learning was carried out 
Conditions, "T} he three experimental sensory 
Variables ang) eer en ettion._ of the sensory 
Were Sin the three experimental lists 
Were the erbalanced within the design, as 
Ho. presentations of the practice condi- 

At t 
subject Conclusion of the experiment, the 
Sory eat ile seen to rank the three sen- 
Ost preferred. the least preferred to the 


Ris 
RESULTS AND D1scuSsION 


The 
are fires ean number of errors in recall 
ditions ee in Table 1 for all con- 
eorrect n error was defined as an 
recall.) pag as well as a failure to 
Iates a n inspection of the data in- 
Visual consistent superiority for the 
tations rs d visual + auditory presen- 
lon, This. the overt practice condi- 
both gr is superiority is consistent for 
Condit; 9 levels, The covert practice 
Sixth a appears superior for the 
at this aders with little differentiation 
8 grade level between the sensory 


Varia a 
was carried 


4 since re- 
uring this 


i 
pia The results of # 
oyaented in Table 2. 
erall difference was f 
he sensory modes of stimulus presen- 
‘tion, while the Grades X Response 
Ype interaction showed an F of 4.11, 
With an F of 415 peing required at the 
'%. level ‘Analysis of the sensory 
Mode Gavia fe indicated that the au- 
ditory presentation of stimuli was sig- 


263 


TABLE 1 
Mean NUMBER OF ERRORS IN RECALL FOR 
Overt AND Covert Response 


ConDITIONS 
Response groups 
a 
Stimuli Overt Covert 
—_——— | 
Trial 1 | Trial 14 | Trial 14 
_——— 
Grade 4: 
Auditory 8.2 2.8 | 3.3 
Visual 7.3 0.8 | 2.1 
Auditory + Visual | 7.3 2.0 2.3 
Grade 6: 
Auditory 8.3 3.2 | 0.07 
Visual 6.6 1.2 0.07 
Auditory + Visual} 7.1 1.5 | 0.07 


r in recall to visual 


nificantly inferio 
ly different from au- 


but not significant 


ditory + visual. 
It was decided beforehand to carry 


out two additional statistical analyses. 
A separate analysis was carried out 
on the covert response groups and the 
results indicated: (a) fourth graders 
producing significantly more errors 
than sixth graders, (b) no significant 
difference between sensory modes for 
the sixth grade, and (c) for the fourth 
grade, auditory presentation produc- 
ing significantly more errors (p < .05) 
than visual. A final analysis of vari- 
ance was carried out on Trial 1 for 
the overt response condition. The re- 
sults indicated a significant sensory 
mode effect (F = 7.95, df = 2/32, p 
< .01) with auditory presentation 
showing significantly more errors than 
either of the other sensory modes for 
both grade levels. 

It is obvious that the results of the 
present study do not agree with those 
of earlier studies (Erickson & King 
1917; Russell, 1938) which indicate A 
superiority for the auditory method 
for younger children. In the earlier 
studies, however: (a) more meaning- 
ful material was used, (6) the audi- 


264 


TABLE 2 


ANALYSIS OF VARIANCE FOR Errors IN 
REcALL on Tray 14 


Source aj US 

Grades (G) 1 | 22.23 | 2.83 
Response types (RT) 1} 2.09 
Gx RT 1 | 32.23 | 4.11 
Subjects within 32 | 7.85 

groups 
Sensory modes (SM) 2 | 15.87 | 9.12* 
SM xX G 2) 0.84 
SM xX RT 2] 5.08 | 2.92 
SMX RT XG 2) 2.07 | 1.19 
Error 64) 1.74 

Total 107 
*p=.01. 


tory presentation of material consisted 
of class lecture and/or reading by a 
status figure, and (c) less stringent 
controls were exercised over the ex- 
perimental environment. One of the 
advantages inherent in the use of the 
visual method for presenting nonsense 
syllables is that the letters for any 
three letter syllable are presented si- 
multaneously. With the auditory 
method, each of the letters is spelled 
serially thus requiring the subject to 
organize the letters in the syllable 
while learning (Krawiec, 1946). Byi- 
dently the covert condition allowed the 
sixth graders to learn and organize the 
material better than the overt method 
for all modes of stimulus presentation, 
This was not true for the fourth 
graders, although the younger children 
had little difficulty in recalling the ma- 
terial as well as the sixth graders when 
written responses were required after 
each trial. 

The results of the preference rank- 
ings of the three Sensory modes paral- 
leled those of the error score analyses, 
The visual and auditory + visual 
modes were preferred Significantly 
more than the auditory mode, with 
little preferential difference being in- 


JOAN LOCKARD AND JOSEPH B. SIDOWSKI 


dicated between the two visual condi- 
tions. 


SuMMARY 


The learning of nonsense oe 
by fourth and sixth graders kee 
vestigated as a function of: (4) (ate 
modes of stimulus presentation aa 
ditory, visual, and auditory + hee 
and (b) two response types; ) 
(written) and covert powete 

All subjects were required to ant 
a list of nonsense syllables under ae 
sensory mode of stimulus a 
tion. Each list was presented o i 
trials under all conditions. er? a 
subjects in each grade peaks 3 a8 
wrote out as many of the syle ale 
Possible after each of the 14 pects 
(overt group). The remaining sv ag 
merely observed and learned the ce 
lables for 14 trials with no written 
call responses (covert). After the en 
trials, subjects in all groups were -. 
as much time as was needed to re° 
and write out the syllables. ig- 

Analyses of the data indicated & a 
nificant superiority for the visual nue 
auditory + visual methods of stimu #0 
Presentation. No significant differe? 
was found between the fourth a 
sixth graders for the overt responr™ 
Condition. For the covert groups, how" 
ever, the fourth graders recalled oe 
nificantly fewer syllables than tb 
sixth graders, 


REFERENCES 


Day, W.F., & Bracu, B. R. A survey of 
research literature comparing the Oe 
and auditory presentation of infor 
tion. USAF ‘tech, Rep., 1950, No. 592} 
Wright-Patterson Air Force Bases 
(Dayton, Ohio), : 

Eur, F. R. Memory for visual, auditory 
and visual-auditory material, Arch. Psy" 
chol., NY, 1936, 29, No. 199. 

Entcxson, C. 1, & Kina, I. A comparison of 
visual and oral Presentation of lessons i2 
the case of pupils from the third to the 

ninth grade. Sch, Soc., 1917, 6, 146-148. 


ee 


es 


SL 


LEARNING AND SENSORY MODE OF PRESENTATION 


Evay 

ie - L, Guaser, R., & Homme, L. E. A 

iminary investigation of variation in 

Hd Properties of verbal learning se- 

: ee of the “teaching machine” type. 

Peake Lumsdaine & R. Glaser (Eds.), 

ene machines and programmed 

Ca A source book. Washington, 

1980 -: National Education Association, 

Gus Gp. 446-451. : 

ms = silane veers value of non- 
ables. J. 

xine ane oer genet. Psychol., 1928, 

Sis S.A comparison of learning and 

iA st to eg ie presented visually 
orily. J. 

34, ee gen. Psychol., 1946, 


265 


Luauey, F. H. Research in radio education 
at Ohio State University. In, Education 
on the air: Fourth yearbook of the in- 
stitute for education by radio. Colum- 
bus: Ohio State University, 1933. 

Russet, R. D. A comparison of two methods 
of learning. J. educ. Res., 1938, 18, 235- 
239. 

Srowskt, J. B., & Eason, R. G. Drive, verbal 
performance, and muscle action poten- 
tial. J. exp. Psychol., 1960, 60, 365-370. 

Stuvermay, R. E., & ALTER, M. Note on the 
response in teaching machine programs. 
Psychol. Rep., 1960, 7, 496. 


(Received April 5, 1960) 


Journal of Educational Psychology 
1961, Vol. 52, No. 5, 266-274 


THE ROLE OF DISCRIMINABILITY IN MEANING- 
FUL VERBAL LEARNING AND RETENTION 


DAVID P. AUSUBEL axp DONALD FITZGERALD 


Bureau of Educational Research, University of Illinois 


In a recent study (Ausubel, 1960), 
it was shown that introductory mate- 
rial at a high level of abstraction, gen- 
erality, and inclusiveness (advance or- 
ganizers) facilitates meaningful verbal 
learning and retention, By deliberately 
introducing relevant and appropri- 
ately inclusive subsuming concepts 
into cognitive structure, one provides 
helpful ideational scaffolding which 
enhances the incorporability and lon- 
gevity of the more detailed material in 
the learning passage. 

Advance organizers, however, or- 
dinarily have two distinct functions 
that correspond in turn to two differ- 
ent aspects of the unfamiliarity of 
meaningful learning material. Some- 
times, as in the above mentioned ex- 
periment, the new material is almost 
completely unfamiliar in the sense 
that cognitive structure is barren of 
even generally related concepts. Under 
these circumstances the purpose of the 
organizer is simply to provide idea- 
tional anchorage or scaffolding. More 
typically, however, the new learning 
material (e.g., Buddhist doctrines) is 
a variant of related, previously learned 
concepts (Christian doctrines) already 
established in cognitive structure, 
Here the role of the organizer js not 
only to provide optimal anchorage at 
an optimal level of inclusiveness, but 
also to increase the discriminability of 
the learning passage from analogous 
and often conflicting ideas in the 
learner’s cognitive structure, 

This second role of organizers js 
predicated on the assumption that if 
the distinguishing features of the new 
learning passage are not originally sa- 


a 
lient or readily discriminable son 
tablished ideas in cognitive saan 
they can be adequately repres a 
by the latter for memorial purp — 
and hence would not persist as their 
arately identifiable memories in ther 
own right. It is assumed, in ma 
words, that only discriminable ie 

gorical variants of previously ae po- 
concepts have long-term retention five 
tentialities. Thus, if a wig ee 
type of organizer could first, the 
clearly, precisely, and explain gel 
principal similarities and ane 
between the new learning epee 
(Buddhism) and existing, related co 
cepts in cognitive structure (Gh - 
tianity), it seems reasonable to ae 
pose that the more detailed Budd “= 
ideas would be grasped later ee 
fewer ambiguities, fewer compet! ie 
meanings, and fewer misconception” 
Suggested by the learner’s knowle : js 
of Christianity ; and that as the 2 
clearer, less confused Buddhist mean 
ings interact. with analogous chee 
tianity meanings during the retentie” 
interval, they would be more likely i 
retain their identity. In this exper!” 
ment, therefore, the value of a co™ 

parative organizer was tested by COD 
trasting its effects on the retention ° 

a Buddhism learning passage wit? 
both those of a nonideational (histor 
cal) introduction and those of a sitt- 
ple expository organizer, 

It is hypothesized, first, that to the 
extent that the organizer is rendered 
discriminable from related concepts 
(Christianity) established in cognitive 
structure, and hence to the extent that 
it increases the discriminability of the 


266 


_ ee 


~ 


\ 


DISCRIMINABILITY IN LEARNING AND RETENTION 


ee learning passage from these 
lec rae concepts, it facilitates the 
Bu ddhee and retention of the new 
withiegdes ideas. In addition, it is hy- 
he pasa for analogous reasons, that 
lea ey (and hence the 
Dees = and retention) of the Bud- 
the cee varies as a function of 
e's eins and stability of the learn- 
ity cd ing knowledge of Christian- 
tttigleny that subjects with relatively 
Christi and unstable concepts of 
mae ee derive relatively more 
Subjents TOM the organizers than do 
in this S ith clear and stable concepts 
el oe of knowledge. 
he find; 1ypotheses were suggested by 
Aunibel of a previous experiment 
earnin & Blake, 1958) in which the 
tive ne and retention of a compara- 
trasted uddhism passage were con- 
Mgt that of an expository 
fo ies passage. However, for the 
Proach 5 come, the organizer ap- 
ment — adopted in this experi- 
the eet preference to manipulating 
Banizers material itself: First, or- 
Scaffoldi provide advance ideational 
earner ng. Second, they provide the 
“ Hs ith a generalized overview of 
eren, he major similarities and dif- 
i between the two bodies of 
Cone before he encounters the new 
epts individually in more detailed 
they particularized form. ag 
ie create an advance os an = 
ie panoeive similares el 
erences, and, by avoiding overly 
: 4 neourage him 


explicj Be 
licit. specification, ! ; 
“tively to make his OW? differentia- 
r : his OWN particular 


— in terms of I 
Urces of confusion : “ 
The hypotheses a vanced in this 
Study apply only to meaningful learn- 
ing racterial which, gious a 
amiliar to sublec’” Me ea a 
ong-c: ishe and relatively stable 
g-establishe¢ “~~. structure. They 


Concepts in cos” ing, t = 
do not apply #0 7° Jeaenneese aoe 


267 


pletely unfamiliar learning material, 
or to learning tasks that can only be 
related to unstable or recently learned 
concepts. The organizers, further- 
more, must consist of ideational ma- 
terial (both similarities and differ- 
ences) at a high level of abstraction, 
generality, and inclusiveness, rather 
than constitute a simple summary or 
a mere listing of specific comparative 


points. 
MetHop 


Subjects 


The experi 
predominantly 
(94 women and 61 men) 
educational psychology 
versity of Illinois. All subjects were enrolled 
in one of 10 teacher education curricula at 
the secondary school level. The experiment 
was conducted separately in each section as 
a required laboratory exercise, and was per- 
formed during regularly scheduled class 
hours. In order to maximize ego involvement, 
subjects were informed that after the data 
were processed their individual scores, as well 
as the class results, would be reported to 


them. 


Learning Passages, Organizers, 
Measuring Instruments 


The learning material used in this study 
was a specially prepared 2,500-word passage 
dealing with Buddhist concepts of God, im- 
mortality, soul, faith, salvation, morality, 
and responsibility. These concepts were elab- 
orated in considerable detail. The passage 
only presented the significant ideas of Bud- 
dhism and ignored material on the life of 
Buddha, the history and geographical distri- 
bution of Buddhism, schools of Buddhism 
and Buddhist ritual. Flesch analysis of the 
passage yielded an abstractness score of 19.19 
(highly abstract) and a readability score of 
42.46 (difficult). 

The topic of Buddhism was chosen both 
because it was explicitly unfamiliar to under- 
graduate students, and because it dealt with 
variants of previously learned concepts (i.e 
Christian doctrines) generally familiar to all 
of our subjects and, presumably, reasonably 
well-established in the cognitive etraehin =A 
most of them. Both criteria were mporant 
since the main purpose of the study was to 


mental population consisted of 
- senior undergraduate students 
in six sections of an 
course at the Uni- 


and 


268 


ascertain whether advance organizers could 
facilitate the learning and retention of un- 
Jamiliar meaningful material by increasing 
the discriminability between the new ma- 
terial and related concepts already estab- 
lished in cognitive structure. The use of un- 
familiar learning material also made it 
Possible for all subjects to start from ap- 
Proximately the same base line in learning 

€ passage. Empirical confirmation of the 
unfamiliarity of the Buddhism material was 
obtained when a group of comparable sub- 
Jects, who had not studied the learning pas- 
sage, made scores which, on the average, were 
only slightly and not Significantly better 
than chance. 

Still another advantage of the Buddhism 
material inhered in the fact that its learn- 
ability was relatively uninfluenced in our 
population by such factors as sex, field of 
specialization, and variability in relevant in- 
cidental experience, It is true that women 
subjects made significantly higher scores than 
men subjects on both the Buddhism and 
Christianity tests. This Superiority, however, 
was not a function of differential background, 
interest, or motivation related to sex per se, 
but reflected significantly superior academic 
aptitude as measured by verbal score on the 
School and College Ability Test (SCAT), 

Knowledge of the Buddhism material was 
tested 3 and 10 days after learning, by equiv- 
alent forms of a 45-item multiple-choice 
test with corrected split-half reliabilities of 
80 and .79, respectively. The correlation of 
the 3- and 10-day retention Scores was .79, 
This latter figure indicated both relatively 
high stability over time, in view of the dif- 
ferent (3- versus 10-day) retention abilities 
involved, as well as a high degree of Telation- 
ship between the two retention abilities, Test 
questions covered principles, facts, and ap- 
plications, and were selected by an item 
analysis procedure from a larger population 
of items. Scores on both tests showed a satis- 
factory range of variability and were distrib- 
uted normally. Since they were intended as 
power tests, no time limit was imposed, 

Three types of introductory passages were 
used in this experiment, each about 500 words 
in length. The comparative Organizer pointed 
out explicitly the principal Similarities and 
differences between Buddhist and Christian 
doctrines. This comparison wag Presented at 
a much higher level of abstraction, general- 
ity, and inclusiveness than the Buddhism pas- 
sage itself, and was deliberately designed to 
increase discriminability between the two 
sets of concepts. The expository organizer, 
on the other hand, merely Presented the 


DAVID P. AUSUBEL AND DONALD FITZGERALD 


e i vel 
principal Buddhist doctrines at a ee 
of abstraction, generality, and inclu ever to 
without making any ae ga rig 
Christianity. It was not explicitly the two 
to increase discriminability bapree pens 
religions, but merely to provide aren d Bud- 
ideational scaffolding for the detai duction. 
dhist material. The historical intro tment. 
was intended solely as a control nd hurmaa 
It contained interesting historical an d Bud- 
interest material about Buddha ideationa 
dhism, but neither provided any vara BE 
scaffolding nor attempted to comp tion was 
dhism and Christianity, No informabot T 
included in any of the introductory viaee ip 
that could constitute a direct advan test. 
answering questions on the Buddha 

It was methodologically impor for the 
provide an historical introduction ined dif- 
control group in order that any obtai een ex- 
ferences in retention outcomes eet (his- 
perimental (comparative) and ae to the 
torical) groups could be attribute tive of 
Particular nature of the compara on dis- 
ganizer (ie., to its enhancing effects nee per 
criminability) rather than to its prese! roup 
se. The purpose of exposing another owe 
of subjects to an expository introduce or- 
to determine whether the comparati pility 
ganizer could increase the discrimina that 
of the learning passage over and above © 3 
which could be attributed to the sue on 
a simple organizer (i.e., to the mere prov’ 
of advance ideational scaffolding). Christi- 

A 36-item multiple-choice test on oD. 
anity was used to measure variability 9m 
our subjects with respect to the stability on- 
clarity of those existing (Christianity ) ae 
cepts within cognitive structure which wea 
analogous to the Buddhism learning ape 
and hence potentially interfering. Test ne 
Were noncontroversial, dealing with Old re 
New Testaments, church history, and The 
nominational beliefs and differences. d 
Scores on this test were normallv distributet 
a the corrected split-half reliability W 


Scores on the verbal portion of the SCAT 
Were available for 65 subjects. This test ha 
en Previously administered as part of é 
attery of entrance examinations for i? 
coming university freshmen. 


Procedure 


—€ 


7 


DISCRIMINABILITY IN LEARNING AND RETENTION 


poten was determined by random assign- 
esting e population of each of the three 
forthat ms groups was also stratified by sex 
jects w. proportion of men to women sub- 
Was ie ld be the same in each group. This 
nificantly hee. because of the women’s sig- 
possible t igher verbal SCAT scores. It was 
simultan © administer all three treatments 
identi because they consisted of 
(with ide: reyes introductory passages 
only in ee of directions), differing 
Sendo the possible effects of prior 
those fe exposure to the Buddhism material, 
Aig op subjects (5% of the total) who had 
equally saat in comparative religion were 
thre : ributed in random fashion, among 
eesumed there ues It had also been 
Would eentiee random assignment of subjects 
equivalent er the different treatment groups 
learning a respect to such factors as 
ity. This nese and knowledge of Christian- 
cally by Fea was confirmed empiri- 
ese gro e finding that differences between 
Stores eo SCAT and Christianity test 
Th A negligible. 
ferent i er to control for the effects of dif- 
Climate nstructor, situational, and classroom 
within soe in the six sections, students 
among he section were equally divided 
analysis po Maes treatment groups. Since 
f Varian the data showed that homogeneity 
tional as ce prevailed, both on an intersec- 
both the a as on an intergroup basis, for 
Sets of B hristianity test scores and the two 
fred ; uddhism test scores, it was consid- 
three {Sutlable to treat the scores of the 
Strum reatment groups on each of these in- 
he as comparable random samples 
Same population. 5 
intrody days athe suidying their designated 
the ga ctions, all subjects read and studie' 
One Mme Buddhism passage 10F 2. 
i pipe Buddhism ban hee erat 
equi all subjects 3 days aver, = 
cin alent form of the same se et oe 
10 qacred one week after the first “es 
ays after the learning eepoer ee OUP 
iy Se randomly assignee of the six 
Sectio Sots was consti ts described above 
or = along the Bae eeups. The Proce- 
tre Bs three treatm ers Special group was 
ident; ollowed with # for the comparative 
aa leal with that Ue ts studied @ 1,800- 
Word except that sum focrinology of human 
bub, Passage on the Or he Buddhism passage. 
"Dy eScence instea eo rocedure Was to ascer- 
ti purpose of eee jmowledge of the 
in to what exten er (without any expo- 


Omparative ors 


fro: 


269 


sure to the Buddhism learning passage itself) 
could increase scores on the Buddhism tests 


beyond chance expectancy. 


Resutts AND Discussion 


Comparison of corresponding 3- 
and 10-day means of total treatment 
groups shows that retention loss dur- 
ing this interval was relatively slight. 
The loss was greatest in the historical 
group (p = .05), least in the exposi- 
tory group, and intermediate in the 
comparative group (p < 10). This 
low degree of retention loss is prob- 
ably attributable both to the nega- 
tively accelerated shape of most re- 
tention curves, particularly in the 
case of meaningful material, and to 
the “rehearsal effect” of the 3-day 
Buddhism test on the subsequent test 
of retention. The retention scores of 
our subjects over the two intervals 
were highly correlated (r = .79). 


Effect of Organizers on Retention 


On a 3-day basis, only the com- 
parative organizer was effective in 
facilitating retention of the Buddhism 
material. Table 1 shows that the mean 
retention score of the total compara- 
tive group was significantly superior 
to both that of the historical group 
(p = .05) and that of the expository 
group (p = 05). However, practi- 
cally all of this obtained difference 
between the comparative group, on 
the one hand, and the expository and 
historical groups, on the other, was 
derived from the below-median sub- 
groups on the Christianity test. 
Within these below-median  sub- 
groups the differences between the 
means in question were much greater 
than the corresponding differences be- 
tween total treatment groups, and 
their level of significance was also 
correspondingly higher (p = .02, p = 
.005). : 

It is apparent, therefore, that al- 


270 


though provision of ideational scaf- 
folding in the form of an expository 
organizer did not enhance retention 
of the Buddhism passage over a 3- 
day interval, the combined scaffold- 
ing and explicit discriminability ef- 
fects induced by the comparative 
organizer did significantly improve 
retention. The short-term retention 
loss in the control group was evi- 
dently small enough to preclude the 
possibility of a significant difference 
in retention attributable to the facili- 
tating effects of a simple expository 
organizer. It is true that an expository 
type of organizer significantly jn- 
creased 3-day retention in an earlier 
experiment (Ausubel, 1960) ; but the 
learning passage used then was more 
unfamiliar to subjects, and the latter 
also had the benefit of studying the 
organizer on two separate occasions, 
As hypothesized, subjects with rela- 
tively superior knowledge of Christi- 
anity derived considerably less bene- 
fit from the comparative organizer 
than did subjects whose knowledge of 
Christianity was less impressive. This 
finding was consistent with the self- 
evident proposition that if discrimina- 
bility of a learning passage is already 
high because of endogenous factors 
within cognitive structure (i.e, be- 
cause of the clarity and stability of 
related established knowledge), less 
scope exists for the potentially facj]j- 
tating influence of exogenously ma- 
nipulated factors (i.e., organizers) de- 
signed to promote discriminability, 
On a 10-day basis, both the com- 
parative and expository total groups 
were significantly superior to the his- 
torical total group in retaining the 
Buddhism material (p = 02, p = 
05), but the difference between the 
comparative and expository groups 
was negligible (see Table 1). As was 
true of the 3-day scores, however, 
most of the difference between or 


DAVID P. AUSUBEL AND DONALD FITZGERALD 


ganizer and control groups was id 
rived from the ohaitapieege: 
groups on the Christianity test. dian 
the means of just these perigee 
subgroups were compared, the sig tor- 
cance level of the comparative- tt 
ical and the meponitirp EOS ; 
ferences was enhanced (p = inveed 
Pp < .05), and the difference be itory 
the comparative and meee 
groups was significant at the . 3-day 
In comparing the 10- to the that 
retention data, it appears ane was 
only over the longer time interva ntly 
the natural retention loss gage 
great to provide scope for the ifo he 
facilitating influence of the sca nae? 
ing effects available from the y the 
tory organizer. Second, althoug! nif 
comparative organizer was not = 08° 
icantly more effective than snag e 
itory organizer over the longet ne in 
val when the results of all a er 
these groups were considered, yirec 
Was a suggestive trend in this Se ° 
tion among the below-median § 
jects. oe 
One explanation of the relative! 
small difference in 10-day reten ps 
attributable to the influence of ssi- 
plicit Comparison per se, is the Poe in 
bility that confronting the learner 
advance with the major principles 
Buddhism in a detail-free pen 
(the expository organizer) implicr is 
increased the discriminability of on 
Buddhism ideas by enabling him sh 
make his own comparisons wit! 
Christianity, Another plausible che 
Planation is that by enhancing ¢ 
tention generally, the rehearsal anet 
induced by the 3-day Buddhism tes 
had a leveling influence on the 70m 
tive degree of facilitation that mig) 
have been expected from the tw? 
kinds of organizers. This interpreta- 
tion is supported by the erosion of the 
Significant retention difference be- 
tween the comparative and expository 


ae 


DISCRIMINABILITY IN LEARNING AND RETENTION 


is TABLE 1 
AN Re 
o Seranerol Test Scores or EXPERI- 
NTAL AND ConTROL GROUPS ON 
Buppuism PassaGE 


Treatment group 


Knowledge of 


ristianity Compara-  Exposi- 

Compare Espeaonuat 

‘ganizer ganizer istorical) 

ee Tetention: 
teed 23.50 22.50 93.42 
ra 20.50 17.32 16.52 
aise retention: 21.83 19.91 19.97 
ove nant 

Bdoo 21.79 22,97 20.87 
ia 19.21 17.02 14.40 
20.31 19.65 17.63 


Note.—c 
; -—Ch ‘ 
45 items is oo score on the multiple-choice test of 


i 
rae from the third to the tenth 
aime tendeney noted above in the 
ie cae only the below- as 
groups oO the above-median sub- 
erive on the Christianity test to 
Banize appreciable benefit from the or- 
ay a appeared in the 10- 
retention Fest in the learning and 
erial th : unfamiliar ideational ma- 
concept at is relatable to established 
thot, S in the learner’s cognitive 
Dosit ure, both comparative and ex- 
ecti ory organizers appear to be ef- 
oxi Ave only in those instances where 
Ping: (endogenously determined) 
Scriminability between the two sets 
q ideas is inadequate a5 @ Conse 
om of the instability oT ambig- 
ity of the established concepts. 
he special control group which 
only studied the comparative — 
Sanizer (without ary, ade 
Uddhism passage ts2 Ma 3-d 
Mean score of 43.20 on he ay 
§ score of 
uddhism test and @ Tn, 
13.45 Se 108 ay test. These scores 
rr on the ntl greater than the 
P Te significa rable naive group 
Cores of a COP, adhism tests with- 
(which took the q to either organizer 
out being exPor? 


271 


or learning passage), but were sub- 
stantially below those of the histori- 
cal and two organizer groups. 

It is quite unlikely, however, that 
the organizers per se directly furnished 
pertinent information enabling sub- 
jects to obtain higher scores on the 
Buddhism tests. In the first place, a 
deliberate effort was made to avoid 
providing such information in the 
two organizers. Second, if the organ- 
izers themselves had furnished useful 
information in answering test items, 
it would be difficult to explain why 
subjects in the expository group did 
not make higher scores than subjects 
in the historical group on the 3-day 
Buddhism test. Much more credible, 
therefore, is the explanation that ex- 
posure to the comparative organizer 
merely increased the general sophis- 
ticaton of subjects in the special con- 
trol group about Buddhist concepts, 
and thereby helped them to eliminate 
misleads in the multiple-choice test 
questions. Thus, they were able to ob- 
tain better than chance scores without 
studying the learning passage itself. 
But when the learning passage was 
available (i.e. in the historical and 
two organizer groups), subjects were 
neither benefited by the general so- 
phistication they obtained from an 
ideational organizer, nor handicapped 
by not possessing such sophistication. 


Effect of Knowledge of Christianity 
on Retention 

The data clearly support the hy- 
pothesis that the discriminability of 
the Buddhism learning material var- 
jes as a function of the clarity and 
stability of the established concepts 
to which it is related (ie., Christian- 
ity), and hence that Buddhism reten- 
tion scores are positively correlated 
with knowledge of Christianity. Table 
1 shows that within each treatment 


272 


TABLE 2 


CorRRELATIONS oF Buppuism RETENTION 
Scores anp OTHER MErasurEs 


Buddhism retention scores 


Other Compara- | Expository Historical 
measures tive group group group 
Sday| GO. | 3-day day | $day) do 
ee — 
Christianity | .37°| .01 | 57+ -42°*) .55°*! .56%* 
scores 
Verbal ability | .62**] .goe*) .z5¢s! +79°*) .58°*) 152° 
Scores 
Christianity +06 |—.07 | .40°*) .14 | .4o%0 43°" 
Scores with 
verbal abil- 
ity elimi- 
nated 


* Significant at .05 level. 
** Significant at .01 level. 


group the mean retention score of the 
above-median group was significantly 
greater than the mean retention score 
of the below-median group. For the 
historical, expository, and compara- 
tive groups, these differences were 
significant at the -0005, .001, and .02 
levels, respectively, on the 3-day 
Buddhism test, and at the -0005, .0005, 
and .10 levels, respectively, on the 
10-day Buddhism test, The same 
trends are shown by the positive 


cor- 
relations between scores on the Bud- 
dhism and Christianity tests (see 


Table 2). 

It is evident, therefore, that the 
clearer and more stable subjects’ 
knowledge of Christianty was, the 
more discriminable this knowledge 
was from Buddhism concepts, and 
hence the higher the Buddhism reten- 
tion scores were. And conversely, the 
less clear and more unstable subjects’ 
knowledge of Christianity was, the 
less discriminable it was from Bud- 
dhism concepts, | and the lower the 
Buddhism retention scores were. 

The data also confirm the hypothe- 
sis that organizers (by virtue of their 


DAVID P. AUSUBEL AND DONALD FITZGERALD 


leveling effect on the ee 7 
criminability advantage inne ae 
clear and stable knowledge of be- 
tianity) reduce the relations P 
tween Christianity knowledge | ai 
Buddhism retention scores 12 gies 
tion to their facilitating effect on i 
tention. The leveling Co ae: 
organizers (particularly le 1 by 
parative) can be seen in ited 4 
comparing the relative magn median 
the differences between above- in the 
and below-median eS his- 
various treatment groups. rative 
torical, expository, and imag these 
groups the respective ¢ values nd 2. 
differences were 4.86, 3.43, a 3, an 
for the 3-day Buddhism rag ay 
4.72, 3.99, and 1.59 for the 3 be 
Buddhism scores. Correlaiter 
tween the Buddhism and Chris 7 
Scores show the same trend (see *", 
2). The correlation Bene otitee 
Christianity and Buddhism han iD 
was higher in the historical * the 
the comparative group for bowh 
3-day (p = .12) and 10-day (P ~ den 
retention intervals. The almost nand, 
tical correlations, on the other “ide 
between Christianity and 3-day an 
hism scores in the expository nef 
historical groups, reflected the mee 
fectiveness of the expository OP ie 
izer in facilitating retention over 
3-day interval. tion 
Because of the positive correla ity 
of 44 (p< 01) between Chnatiar 
and SCAT scores, however, it hy 
necessary to check the alternative her 
pothesis that the significantly HE ne 
retention scores of subjects in Si 
above-median groups reflect oe 
verbal ability rather than super 4 
knowledge of Christianity per se os 
dogenous discriminability). But arid 
the availability of only 65 SCA 
Scores made it impossible to match for 
verbal ability those subjects within? 
each treatment group who were above 


anity 


OO SE ——————— ee ae 


DISCRIMINABILITY IN LEARNING AND RETENTION 


and below the median score on the 
Christianity test, it was necessary to 
resort to partial correlation. 

The partial correlations between 
the Buddhism and Christianity 
Scores, with the effect of verbal abil- 
ity eliminated, are shown for all three 
groups in Table 2. For the historical 
sroup the partial correlations of .40 
and .43 (for the 3- and 10-day scores, 
Tespectively), although lower than the 
Sorresponding simple correlations of 
5 and .56 were still significant at the 
01 level. Three of the four other partial 
Correlations, however, were negligible 
and nonsignificant, indicating that 
the Corresponding simple correlations 
argely reflected the positive rela- 
oe between verbal ability and 

Uddhism retention scores. The in- 
TMSic residual relationship between 
‘Nowledge of Christianity and reten- 
ton of the Buddhism material, after 
ae effect of verbal ability was elimi- 

ated, was actually close to zero in 
ee Instances, because of the level- 
an Influence of the organizers on the 
8 Ogenous discriminability advan- 
of Gronierred by superior knowledge 
nd nistianity. The partial correla- 
3 os etween the Christianity and 
.~8y Buddhism scores remained 
“8Nificant at the .01 level in the ex- 
Pository group, in as much as the ex- 
Pository organizer did not facilitate 
Tetention over the 3-day interval. 

_ Intergroup comparison of the par- 
tial correlations in Table 2 shows 
even more clearly than does corre- 
Sponding comparison of the simple cor- 
Telations between Buddhism and 
: hristianity scores, that the pl 
ers (in proportion to their ee 
tating effect on retention) pemnees 
© significant relationship between 
Bn aac d reten- 
knowledge of Christianity an 3 
tion of the Buddhism material. With 
the ; £ verbal ability elimi- 
le influence 0! rrelation between 
Nated, the partial gor” 


273 


Christianity and 38-day Buddhism 
scores in both the historical and ex- 
pository groups was significantly 
higher than in the comparative group 
(p = .05, p = .05). The partial corre- 
lation between Christianity and 10- 
day Buddhism scores in the historical 
group was higher than the corre- 
sponding partial correlations in the 
expository (p = .08) and comparative 
(p = .01) groups. In effect then, by 
being provided with a given type of 
discriminability aid (organizer), sub- 


jects possessing relatively little 
knowledge of Christianity (and hence 
little endogenous  discriminability) 


were placed on approximately the 
same footing with respect to the dis- 
criminability variable and its effect 
on retention, as subjects possessing 
greater knowledge of Christianity 
(and greater endogenous discrimi- 
nability). 

Table 2 also shows that the corre- 
lation between verbal ability and 
Buddhism retention scores in the his- 
torical group (where the relationship 
was not influenced by interaction with 
organizer effects) was approximately 
the same as that between Christianity 
and Buddhism scores. But unlike the 
latter relationship, which was at- 
tenuated by interaction with the or- 
ganizers, the correlation between ver- 
bal ability and Buddhism scores was 
slightly higher in the comparative 
than in the historical group, and sug- 
gestively but not significantly higher 
in the expository than in the historical 
group (p = .17, p = .07). At the very 
least, therefore, it is definite that 
neither organizer detracted from the 
strong positive relationship between 
verbal ability and the retention of the 
Buddhism material; and there was a 
suggestive tendency for verbal ability 
to have an even greater impact on the 
retention of the Buddhism passage 


274 


when subjects were given the benefit 
of an expository organizer. 


Summary AND CONCLUSIONS 


This experiment was concerned 
with the role of endogenous and ex- 
ternally manipulated discriminability 
in the learning and retention of un- 
familiar ideational material (Bud- 
dhism) that was relatable to previ- 
ously learned concepts (Christianity) 
already established in cognitive struc- 
ture. . 

The learning task consisted of a 
2,500-word passage on the principles 
of Buddhism. Two days before study- 
ing this learning passage, one experi- 
mental group studied a 500-word 
comparative organizer explicitly com- 
paring the major ideas of Buddhism 
and Christianity. Another experimen- 
tal group studied an expository or- 
ganizer which made no reference to 
Christianity. A control group studied 
an historical introduction dealing 
with the history rather than with the 
ideas of Buddhism. Retention of the 
Buddhism material was tested 3 and 
10 days after the learning session by 
means of equivalent forms of a multi- 
ple-choice test. For purposes of anal- 
ysis, subjects in the various treatment 
groups (university undergraduates) 
were divided into above- and below- 
median subgroups in terms of their 
scores on an objective test of Christi- 
anity. 

Subjects with greater knowledge 
of Christianity made significantly 
higher scores on the Buddhism reten- 
tion test than did subjects with less 
knowledge of Christianity. This sig- 
nificantly positive relationship be- 


DAVID P. AUSUBEL AND DONALD FITZGERALD 


tween Christianity and Buddhism test 
scores held up even when the effect of 


verbal ability was eliminated. Hence ° 


the data support the hypothesis that 
the learning and retention of unfamil- 
iar verbal material varies positively 
with its discriminability from related, 
previous learned concepts established 
in cognitive structure, and that this 
endogenously determined discrimi- 
nability is a function of the clarity and 
stability of the latter concepts. 

On a 3-day basis only the compara- 
tive organizer was significantly effec- 
tive in facilitating the retention of the 
Buddhism material, but over a 10-day 
interval both comparative and exposi- 
tory organizers were significantly ef- 
fective. These facilitating effects of 
the organizers on retention outcomes, 
however, only applied to subjects in 
the below-median subgroups on the 
Christianity test. Thus in the learning 
and retention of unfamiliar ideational 
material that is relatable to estab- 
lished concepts in the learner’s cogni- 
tive structure, both comparative and 
expository organizers appear to be 
effective only in those instances where 
existing discriminability between the 
two sets of ideas is inadequate as a 
consequence of the instability or am- 
biguity of established concepts. 


REFERENCES 


AususeL, D. P. The use of advance organ- 
izers in the learning and retention of 
meaningful verbal material. J. edu. 
Psychol., 1960, 51, 267-272. 

AususeL, D. P,, & Buaxg, E., Jr. Proactive 
inhibition in the forgetting of meaning- 
ful school material. J. educ. Res., 1958, 
52, 145-149, 


(Received May 8, 1961) 


Journal of Educati 
tional Psych 
1961, Vol. 52, No. 6, 376-281 nee 


JOURNAL OF 
EDUCATIONAL PSYCHOLOGY 


Volume 52 


December 1961 


Number 6 


THE GENETIC APPROACH TO THE PSYCHOLOGY 
OF THOUGHT’ 


JEAN PIAGET 
University of Geneva, Switzerland 


F 

view, df a developmental point of 
ing 2 Ue essential in the act of think- 
Say a contemplation—that is to 
pi eer the Greeks called 
YNamics, —but the action of the 
aking $ : 

— into consideration all that 
Ron one ¢c isti ; 
Principal Usvasiee an distinguish two 
eals a formal viewpoint which 
State of oe the configuration of the 
Most p ings to know—for instance, 


ercepti i i 
agerics, eptions, mental images, im- 


2, , 

With es dynamic aspect, which deals 
Pee Snsformations—for instance, 
erstand nect a motor in order to un- 
| 4 ena its functioning, to disassoci- 
| Physical vary the components of a 
\ OSiis Phenomenon, to understand 
| 8 ie alities, to isolate the elements 
Veg "et iee figure in order to in- 

he a properties, etc. 
thought udy of the development of 
: pee that the dynamic as- 
to attain the same time more difficult 
“Only tp and more important, because 
ansformations make us under- 


Thy 

B, ie Bees transmitted by Ernest 
aget, f text of an address made by 
my ‘ rom notes, at the New York 
Igo cholo: of Sciences’ Conference on the 
960, *ppSY of Thinking on April 28-29, 
Solbin ne translation was made by Ruth 
Totaing?, “ho has skillfully succeeded in 


ng Pj Seay ee 
i RDressien  oeet's individualistic style of 


Pect 


stand the state of things. For in- 
stance: when a child of 4 to 6 years 
transfers a liquid from a large and 
low glass into a narrow and higher 
glass, he believes in general that the 
quantity of the liquid has increased, 
because he is limited to comparing the 
initial state (low level) to the final 
state (high level) without concerning 
himself with the transformation. To- 
ward 7 or 8 years of age, on the other 
hand, a child discovers the preserva- 
tion of the liquid, because he will 
think in terms of transformation. He 
will say that nothing has been taken 
away and nothing added, and, if the 
level of the liquid rises, this is due to 
a loss of width, etc. 

The formal aspect of thought makes 
way, therefore, more and more in the 
course of the development to its dy- 
namic aspect, until such time when 
only transformation gives an under- 
standing of things. To think means, 
above all, to understand; and to un- 
derstand means to arrive at the trans- 
formations, which furnish the reason 
for the state of things. All develop- 
ment of thought is resumed in the 
following manner: & construction of 
operations which stem from actions 
and a gradual subordination of formal 
aspects into dynamic aspects. 

The operation, properly speaking, 
which constitutes the terminal point 
of this evolution is, therefore, to be 


275 


276 


conceived as an internalized action 
reversible (example: addition and 
subtraction, etc.) bound to other op- 
erations, which form with it a struc- 
tured whole and which is character- 
ized by well defined laws of totality 
(example: the groups, the lattice, 
etc.). Dynamic totalities are clearly 
different from the “gestalt” because 
those are characterized by their non- 
additive composition, consequently 
irreversible. 

So defined, the dynamics intervene 
in the construction of all thought 
processes; in the structure of forms 
and classifications, of relations and 
serialization of correspondences, of 
numbers, of space and time, of the 
causality, ete. One could think at first 
glance that space and geometry add 
to the formal aspect of thought. In 
this way one conceived of the geo- 
metric science in the past, considering 
it impure mathematics, but applica- 
ble to perception and intuition. Mod- 
ern geometry, since Le Programme 
@Erlangen by F. Klein, has tended, 
like all other precise disciplines, to 
subordinate the formal to the dy- 
namic. The geometries are, indeed, 
understood today as relying all on 
groups of transformation, so that one 
can go from one to the other by char- 
acterizing one less general “subgroup” 
as part of a more inclusive group. 
Thus geometry too rests on a system 
of dynamics. 

Any action of thought consists of 
combining thought operations and in- 
tegrating the objects to be understood 
into systems of dynamic transforma- 
tion. The psychological criteria of this 
is the appearance of the notion of con- 
servation or “invariants of groups.” 
Before speech, at the purely sensory- 
motor stage of a child from 0 to 1g 
months, it is possible to observe ac- 
tions which show evidence of such 
tendencies. For instance: From 4-5 to 


JEAN PIAGET 


18 months, the baby constructs his 
first invariant, which is the schema 
of the permanent object (to recover 
an object which escaped from the field 
of perception). He succeeds in this by 
coordinating the positions and the dis- 
placements according to a structure, 
which can be compared to what the 
geometricians call “group displace- 
ments.” 

When, with the beginning of the 
symbolic function (language, sym- 
bolic play, imagerie, etc.), the repre- 
sentation through thought becomes 
possible, it is at first a question of 
reconstructing in thought what the 
action is already able to realize. The 
actions actually do not become trans- 
formed immediately into operations, 
and one has to wait until about 7 to 
8 years for the child to reach a fune- 
tioning level. During this preoperative 
period the child, therefore, only ar- 
tives at incomplete structures char- 
acterized by a lack in the notion of 
combinations and, consequently, by ® 
lack of logie (in transitivity, etc.). 

In the realm of causality one can 
especially observe these diverse forms 
of precausality, which we have pre- 
viously described in detail. It is true 
that a certain number of authors— 
Anglo-Saxon above all—have se- 
verely criticized these conclusions, 
while others have recognized the same 
facts as we have (animism, etc.). Yet, 
im an important recent book (which 
will appear soon) two Canadian au- 
thors, M. Laurendeau and A. Pinard, 
have taken the whole problem up 
once again by means of thorough sta- 
tistics. In the main points they have 
come to a remarkable verification of 
our views, explaining, moreover, the 
methodological reasons for the diver- 
gencies among the preceding authors. 

At about 7 to 8 years the child ar- 
rives at his first complete dynamic 
structures (classes, relations, and 


CS 


GENETIC APPROACH TO PSYCHOLOGY OF THOUGHT 


numbers), which, however, still re- 
Main concrete—in other words, only 
at the time of a handling of objects 
(material manipulation or, when pos- 
Sible, directly imagined). It is not be- 
ore the age of 11 to 12 years or more 

at operations can be applied to pure 
hypotheses, At this latter level, a logic 
of propositions helps complete the 
®onerete structures. This enlarges the 


Structures considerably until their dis- 
Position, 


eo fundamental genetic problem 
© psychology of thought is hence 
explain the formation of these dy- 
amic structures, 
~ inboard one would have to rely 
exp] Tree principal factors in order to 
Plain the facts of development: 
seuttration, physical experience, and 
“lal interaction. But in this partic- 
eae none of these three suffice to 
fo Ish us with the desired explana- 
"S—not even the three together. 
os Aturation. First of all, none of 
4 dynamic structures are innate, 
ey form very gradually. (For 
™Mple: The transitivity of equal- 
S 1s acquired at approximately 614 
years, and the ability of linear 
Measure Comes about only at 9 years, 
.G0es the full understanding of 
Weights, etc.) But progressive con- 
Struction does not seem to depend on 
pturation because the ea oe 
ar ‘ articu: ge. 
7, pe age is con- 
one witnesses innu- 
‘ons or retardations 


exa 
tie, 


rtainly one 
Xcqui perience. Oe i 
ne eine inevitable coe which 
Sturaticn plays but it is determined 
ration p. isting possibility (or 
cw, still remain to be 
- 4}, brings about other 
HOD the domain of 
actors. In a innateness 
thought, the factom 


277 


seem above all limitative. We do not 
have, for example, an intuition of 
space in the fourth dimension; never- 
theless we can deduce it. 

Physical Experience. Experiencing 
of objects plays, naturally, a very im- 
portant role in the establishment of 
dynamic structures, because the op- 
erations originate from actions and 
the actions bear upon the object. This 
role manifests itself right from the 
beginning of sensory-motor explora- 
tions, preceding language, and it af- 
firms itself continually in the course 
of manipulations and activities which 
are appropriate to the antecedent 
stages. Necessary as the role of ex- 
perience may be, it does not suffi- 
ciently describe the construction of 
the dynamic structures—and this for 
the following three reasons. 

First, there exist ideas which can- 
not possibly be derived from the 
child’s experience—for instance, when 
one changes the shape of a small ball 
of clay. The child will declare, at 7 to 
8 years, that the quantity of the mat- 
ter is conserved. It does so before dis- 
covering the conservation of weight 
(9 to 10 years) and that of volume 
(10 to 11 years). What is the quantity 
of a matter independently of its 
weight and its volume? This is an ab- 
stract notion corresponding to the 
“substance” of the pre-Socratic phys- 
icists. This notion is neither possible 
to be perceived nor measurable. It is 
therefore, the product of a dynamic 
deduction and not part of an experi- 
ence. (The problem would not be 
solved either by presenting the quan- 
tity in the form of a bar of chocolate 
to be eaten.) 

Secondly, the various investigati 
into the learning of logical eri 
which we were able to make at our 
International Center of Genetic Epis- 
temology, lead to a very unanimous 


278 


result: one does not “learn” a logical 
structure as one learns to discover 
any physical law. For instance, it is 
easy to bring about the learning of 
the conservation of weight because of 
its physical character, but it is diffi- 
cult to obtain the one of the transi- 


tivity of the relationship of the 
weight: 


A=CiiA=BandB=C 


or the one of the relationship of in- 
clusion, ete. The reason for this is that 
in order to arrive at the learning of a 
logical structure, one has to build on 
another more elementary logical (or 
prelogical) structure. And such struc- 
tures consequently never stem from 
experience alone, but suppose always 
a coordinating activity of the subject. 

Thirdly, there exist two types of ex- 
periences: 

1. The physical experiences show 
the objects as they are, and the 
knowledge of them leads to the ab- 
straction directly from the object (ex- 
ample: to discover that a more volu- 
minous matter is more or less heavy 
than a less voluminous matter), 

2. The logicomathematical experi- 
ence supposes to interrelate by action 
individual facts into the world of ob- 
jects, but this refers to the result of 
these actions rather than to the ob- 
jects themselves. These interrelations 
are arrived at by process of abstrac- 
tions from the actions and their co- 
ordinates. For instance, to discover 
that 10 stones in a line always add up 
to 10, whether they are counted from 
left to right or from right to left, Be- 
cause then the order and the total sum 
have been presented. The new knowl- 
edge consists simply in the dise 
that the action of adding a sum is in- 
dependent of the action of putting 
them in order. Thus the logicomath_ 

2See Etudes d’Epistomologie Gen 
Vol. 7 and 10. 


Overy 


etique, 


JEAN PIAGET 


ematical experience does not ne 
from the same type of learning as in a 
of the physical experience, but ra ae 
from an equilibration of the sche 
of actions, as we will see. F 
Social Interaction. ‘The aman 
and social transmission (inane 
etc.) plays, naturally, an ergs 
in the formation of dynamic § fice 
tures, but this factor does not § Jop- 
either to entirely explain its deve 
ment, and this for two reasons: ‘yuo- 
First, a certain number of st h- 
tures do not lend themselves to meee 
ing and are prior to all teaching. 
can cite, as an example, most aoe 
of conservation, of which, in gene ot 
the pedagogs agree that they are 
problematic to the child. tal 
The second, more fundamen aA 
reason is that in order to underste 
the adult and his language, the ott 
needs means of assimilation which ae 
formed through structures prelim 
nary to the social transmission a 
—for instance, an ancient experie?® 
has shown us that French-spealité 
children understand very early t 
expression “quelques unes de Loe! 
fleurs” [some of my flowers] in C0? 
trast to “toutes mes fleurs” [all mY 
flowers], and this oceurs when theY 


ave not yet constructed the relatio? 
of inclusion: 


Some A are part of all B; therefor? 
A<B 


In Conclusion, it is not exaggerated t° 
maintain that the basic factors iD” 
Voked before in order to explain me? 
tal development do not suffice to e¥” 
Plain the formation of the dynami¢ 
Structures. Though all three of them 
certainly play a necessary role, they 
do not constitute in themselyes guffi- 
cient reason and one hag to add to 
them a fourth factor, which we shall 
try to describe now. 


| 


GENETIC APPROACH TO PSYCHOLOGY OF THOUGHT 


oe fourth factor seems to us to 
said, of & general progression of 
ibration. This factor intervenes, 


aS is t es 2 d 
th the be expected, in the interaction 
e 


han 
tion) 
ternal 


IS selfeay: 
elf-evident that these internal and 


exter; 

other." factors equilibrate each 

We are d © question is then to know if 
ealing here only with momen- 


Bit). ga nose (unstable equilib- 
equilibriy MU, on the contrary, this 
stable 9 becomes more and more 
ree) his shows that all exchange 
tivec as well as biological) be- 
(Dhysig . Organisms and the milieu 
tivo tole and social) as composed of 
the give S: (a) of the assimilation of 
tema = external to the previous in- 
Commodat ere and (b) of the ac- 
S given ton of these structures to 
tween, ones. The equilibrium be- 
ommo 2 assimilation and the ac- 
Stable the 10n is proportionately more 
Whic se the assimilative structures 
rdinateg better differentiated and co- 


8Ssimilate 18 equilibrium between the 
Seems ; C2 and accommodation that 
of the © explain to us the functioning 
cu Teversible operations. This oc- 
tions oe Instance, in the yealm of no- 


£ conservation where the in- 
+ account for 


perience, nor for the sociolingual 


Tang: 
ibijpe Sion. In fact, 


A is ti 
accommodation) 18 %0. ~~ .. 
Sa compensation of exterior disturb- 
Nees through activities of the subject 
Tientate d in the contrary direction of 


279 


these disturbances. This leads us di- 
rectly to the reversibility. 

Notice that we do not conceive of 
the idea of equilibrium in the same 
manner as the “gestalt theory” does, 
which makes great use of this idea 
too, but in the sense of an automatical 
physical equilibrium. We believe, on 
the contrary, that the mental equi- 
librium and even the biological one 
presumes an activity of the subject, 
or of the organism. It consists in a 
sort of matching, orientated towards 
compensation—with even some over- 
compensation—resulting from strat- 
egies of precaution. One knows, for 
instance, that the homeostasis does 
not always lead to an exact balance. 
But it often leads to overcompensa- 
tion, in response to exterior disturb- 
ances. Such is the case in nearly all 
occurrences except precisely in the 
case of occurrences of a superior or- 
der, which are the operations of re- 
versible intelligence, the reversible 
logic of which is characterized by a 
complete and exact compensation (in- 
verted operation). 

The idea of equilibrium is so close 
to the one of reversibility that G. 
Brunner, in a friendly criticism of one 
of our latest books appearing in the 
British Journal of Psychology, pro- 
poses to renounce the idea of equilib- 
rium because the notion of the re- 
versibility seems sufficient to him. We 
hesitate to accept this suggestion for 
the following three reasons: 

First, reversibility is a logical idea 
while the equilibrium is a causal idea 
which permits the explanation of re- 
forms by means of a probabilistic 
schema. For instance, in order to ex- 
plain the formation of the idea of con- 
servation, one can distinguish a cer. 
tain number of successive st; : 
which each is characterj ages, of 
i cterized b 
strategy” of a pro y the 

gress of compensa- 


280 


tion. Now it is possible to show* that 
the first of these strategies (only 
bearing upon one dimension, to the 
neglect of others) is the most probable 
at the point of departure, and further, 
that the second of these strategies 
(with the emphasis on a second di- 
mension) becomes the most likely—as 
a function of the result of the first. 
And, finally, that the third of these 
strategies (oscillation between the ob- 
served modifications upon the differ- 
ent dimensions and the discovery of 
their solidarity) becomes the most 
likely in the functioning of the results 
of the preceding, ete. From such a 
point of view the process of equilibra- 
tion is, therefore, characterized by a 
sequential control with increasing 
probabilities. It furnishes a beginning 
for causal explanations of the reversi- 
bility and does not duplicate the for- 
mer idea. 

Secondly, the tendency of equilib- 
rium is much broader for the opera- 
tion than the reversibility as such, 
which leads us to explain the reversi- 
bility through the equilibrium and not 
the reverse. In effect, it is at this 
level of the obvious regulations and 
sensory-motor feedbacks that the 
process of equilibration starts. This in 
its higher form becomes intelligence, 
Logical reversibility is therefore con- 
ceivable as an end result and not asa 
beginning and the entire reversibility 
follows the laws of a semireversibility 
of various levels. 

Thirdly, the tendency to equilibrate 
does not only explain this final re- 
versibility, but also certain new syn- 
thesis between originally distinct op- 
erations. One can cite in this regard 
an example of great importance: the 
serial of whole numbers. Russell and 
Whitehead have tried to explain the 
basic set of numbers through the idea 


3 Logique et Equilibre, Vol. 2 of Etudes 
Wd Epistemologie Genetique. 


JEAN PIAGET 


of equivalent classes, without a 
to the serial order. This means Hs 
two classes are believed to be oo 
alent, if one can put their respec _ 
elements into a reciprocal cm 
ment. Only when this relationship (ea 
lies on the quality of the objects 
A put into relation with an ee 
with a B, etc.) one does not g¢ ade 
quantity. If this relationship 18 Tnile 
exclusive of the qualities (an ship 
vidual A or B put into ears 
with an Individual B or A) tm 
there exists only one way to dis In 
guish the elements from each Sen 
order not to forget one, or not to 6 ith 
the same twice, one must deal W 
them in succession and inteet yee 
serial factor as well as the struc ho- 
of classes, We may then say, PSY° n0e 
logically speaking, that the seque ef 
of whole numbers is synthesis gine 
two groupings qualitatively disti? as 
the fitting of the classes and serializ® 3 
tion, and that this synthesis tak 
place as soon as one excludes | 
qualities of the elements in questiO™ 
But how does this synthesis oceur 
Precisely by a gradual process @ 
equilibration, . 
On the one hand the child who de 
velops his ideas from numbers is } 
Possession of structures enabling him 
to fit them into classes (classificé 
tions). But if he wants to be exclusiV® 
of qualities in order to answer t0 i 
question “how many,” he becomes U2 
able to distinguish the elements. 
disequilibrium which appears, there 
fore, obliges the child to resort to tb? 
idea of order and take recourse to 2" 
Tanging these elements into a line® 
Tow. On the other hand, if the chil 
arranges the elements as 1, 1, 1, ete 
ow would he know, for instance, hoW 
to distinguish the second from the 
third? This new disequilibrium bring 
him back to the idea of classification: 
The “second” is the element which 


GENETIC APPROACH TO PSYCHOLOGY OF THOUGHT 281 


hash 
“thing one predecessor, and the 


shor be one that has two of them. In 
disequili ed new problem provokes a 
Tite of poe (recognizable through 
tok: ot es ig errors) the solu- 
ration which consists in a re-equili- 
original which brings about a new 
te the synthesis of two systems, up 
val sae of independence. , 

tis, i the discussion of my theo- 
called gt oner has said that I have 
scribe ae aiibrium what others de- 
rue, ie This is perfectly 
Buage is + ne advantage of this lan- 

0 clarify that a cognitive or 


dynamic structure is never independ- 
ent of motivational factors. The mo- 
tivation in return is always solidary 
to structural (therefore cognitive) de- 
termined level. The language of the 
equilibrium _ presents that activity, 
that permits us to reunite into one 
and the same totality those two as- 
pects of behavior which always have 
a functional solidarity because there 
exists no structure (cognition) with- 
out an energizer (motivation) and vice 
versa. 


(Received May 2, 1961) 


Journal of Educational Peychology 
1961, Vol. 52, No. 6, 282-235 


NEED ACHIEVEMENT 


SCALES AS PREDICTORS 


OF ACADEMIC SUCCESS 


MERVILLE C. SHAW 
Chico State College 


The purpose of the present study 
was to determine whether or not three 
of the most widely used scales of need 
achievement would make appropriate 
differentiations between groups of 
high school students matched for abil- 
ity but differing Significantly in mean 
grade point average. The three scales 
used in the present research were the 
Need Achievement scale of the Ed- 
wards Personal Preference Schedule, 
the McClelland Achievement Motiva- 
tion Test (MAMT 
(1956) Achieveme 


erage while Lowell 
significant correlations between the 
same two variables. 

Another study by Weiss, 
heimer, and Groesbeck (1960) re 
significant correlations betwe 


averages of 60 college students in in- 
troductory psyc 
kinson (1950) reports a ne 
insignificant relationship between 
MAMT results and grade point aver- 
age of college students, A study by 
Morgan (1952) was similar in design 
to the present study in that two 
groups of subjects, matched for abil- 
ity, but differing Significantly in grade 
point averages, were compared on an 
MAMT. The ¢ test yielded a differ- 
ence between the two groups on the 


gative but 


282 


fiat 4, the 
MAMT which was significant 8 
-02 level. 


PRocEDURE 


Subjects were selected from oe a 
and senior classes of a high schoo! d an JQ 
students, All students who achieves ental 
above 110 on the California Test “Gil i 
Maturity were considered poten students 
jects for inclusion in the sample. nd who 
who met the ability criterion. he pres 
achieved a grade point average in re class 
vious year of 2.00 (B) or better achieved £ 
fied as achievers, Those who are we 
grade point average of 1.5 or tal 0 it 
classified as underachievers. A_ to! chieve! 
male achievers and 21 female achieve 
participated, Twenty male underat vewi; 
and 19 female underachievers, } jstere 
cooperated. All subjects were a Senareae tt 
the McClelland Achievement MoofereD? 

est, the Edwards Personal P: Te hieve" 
Schedule, and the French Test of A' 
ment Motivation, roups 

The MAMT was administered to Binder 
through use of an opaque projector 8 M 

tral conditions. Cards 7 BM and ¢ Mc 
from the TAT and Cards D and H of uli. 
Clelland’s series were the picture ears 
Subjects Tesponded to the four basic 4 
tions used by McClelland, Each thor 
response was scored twice by the BU Mer 
according to the criteria described by phe 
Clelland et al. (1953, pp. 107-138). not 
achievement level of subjects was cols 
‘nown to the scorer at the time proto! 
Were evaluated, ater’ 

ecause of the Possibility of low Taree 
Scorer reliability on the French test, ¢ es 
Judges were used in the scoring procedu 

Sliven scoring was considered accur! es 
Only when at least two of the three jude 
Tesponded in identical ways. Judges Mihai 
not aware of the achievement level of t 
Subjects they were judging. 

or purposes of statistical analysis, male 
and female groups were Considered sep 
rately. Variance ratio (F) and ¢ tests were 
applied to the scores of achiever and under- 
achiever groups on each of the three nee! 
achievement measures used in order 0 
establish the existence of homogeneity of 


NEED ACHIEVEMENT SCALES AND ACADEMIC SUCCESS 


— TABLE 1 
on CE OF DIFFERENCE BETWEEN 
AND FEMALE ACHIEVERS AND 
NDERACHIEVERS ON THE 
Catirornia Test oF 
Menta Marurrty 


CTMM IQ means 


Groyy  |———__ 
Achie- Under- " : 

i ers Jachievers 

ale 

Femaje | 16-1 | 115.7 | 1.108 | 1.756 
117.0 | 116.7 | 1.796 | 1.205 


Varia: 
of fern to determine the significance 
Measures a between groups on these 
ler oes ia _ addition, Spearman rank- 
peed ac fe ations between each of the 
or each oe ee measures were computed 
Order to the subgroups in the study in 
differen ee the extent to which the 
Wing the a levement measures were meas- 
0 Varian, &me variable. Finally, an analysis 
Ma nie was carried out using the 
as differe ictures, achievement level, and sex 
ntiating criteria? , 


Resutts 


reported in Table 1 in- 
_ groups did not differ in 
Met on the California Test of 
fereng aturity. Analysis of the dif- 
achieve etween achiever and under- 
easure groups on each of the three 
D €S8 of need achievement is re- 


th 
dicaty a” 
Perfoy, 


Mm 
Mine 


Orted ; 
iffere m Table 2. The only significant 
Achiever” found was between male 

Vers and underachievers on the 


Py 
bet Need Achievement scale. In 
:@ Of the other comparisons aw 
e 


ed 
Signi €nee between groups foun 


Cr Cant, C 7 
tye o@Putation of correlations a 
1 ct each of the three scales for eac 
8 °UD used in the study reves ‘7 
Do: Micant correlations. Table e 

Tts these results. 

: i xpress his 
= Th uld like to & <%) h 
Ruscres hot Non to nae Chins 
Ssist, ~ of Psy! > : 
State ae ie computing the analysis 
Se 


283 


TABLE 2 
SIGNIFICANCE OF DIFFERENCES BETWEEN 
ACHIEVER AND UNDERACHIEVER GROUPS 
IN THE MAMT, n AcH SCALE OF THE 
EPPS, anp Frence n Acu Scare 


Mean score on 
criterion 
Test = F 3 
Achie- | Under- 
sey gene 
MAMT 
Male 5.3 | 3.5 |1.50/1.3503 
Female 4.86] 3.7 |1.46] .8028 
EPPS Need 
Achievement 
Male 16.3 | 14.6 |1.78}1.5889 
Female 12.3 | 12.1 |1.17] .1923 
French Need 
Achievement 
Male 8.4| 5.9 |1.41/2.3640* 
Female 7.6 | 7.2 [1.62) .5677 
*p < .05. 
TABLE 38 
RANnK-ORDER CORRELATIONS BETWEEN 


acu or THE NEED ACHIEVEMENT SCALES 
ror Eacnu Group INCLUDED IN 
THE STuDY 


EPPS EPPS MAMT 
Group & & 
MAMT French French 
Males 
Achievers 12 -51* = 25 
Underachievers —.03 +26 09 
Females 
Achievers 14 —.15 .28 
Underachievers —.23 —.13 .20 
*p < .05. 


The analysis of variance revealed 
that variation was highly significant 
(.001) among the four separate pic- 
tures of the MAMT but not signifi- 
cant at the .05 level between either 
achievers and underachievers or be- 
tween males and females. The same 
Jenk of significance was found to exist 
een all interaction variances as 


284 


Discussion 


Results obtained from the analysis 
of differences between achievers and 
underachievers on the MAMT and 
the EPPS Need Achievement scales 
would seem to indicate that neither of 
these instruments would have had 
predictive value for the groups in- 
cluded in this study. The French Need 
Achievement scale did reveal a differ- 
ence significant at the 5% level for 
male members of the sample. 

The single significant correlation 
was found between the Need Achieve- 
ment scale of the EPPS and the 
French Need Achievement scale for 
male achievers. These three scales not 
only tend to be poor predictors of 
academic achievement for this group 
but are not measuring the same vari- 
able. 

Results of the analysis of variance 
of the MAMT indicate that the great- 
est source of variance exists between 
pictures rather than between groups 
of achievers and underachievers, be- 
tween sexes, or between interaction 
variances. It would appear that for 
the groups used in this study the four 
pictures of the MAMT were not all 
measuring the same qualities, 

It would not be reasonable to con- 
clude on the basis of this single study 
that present need achievement scales 
are not predictive of academic attain- 
ment, but the fact that other studies 
also have frequently failed to find any 
relationship between need achieve- 
ment scales and academic perform- 
ance signifies the need for critical 
re-examination, not only of need 
achievement instruments, but of the 
concept of need achievement itself, 
The basic assumption underlying the 
idea of the “need to achieve” is that 
it is a quantity which some people 
have in greater degree than do others, 
and that quantitative measures of this 
variable will provide an index to the 


MERVILLE C. SHAW 


individual’s need to achieve in ‘s 
areas. It would appear that os 
fallacies grow out of this asumpud 
The first fallacy is that the need = 
achieve is constant for all areas ont 
individual’s life space. The second fa 
lacy is that the need to achieve resu y 
in observable production of a ae 
desirable sort. The third fallacy 4a 
volved in the basic assumption oy lt 
need achievement theory is that es 
need to achieve and behavior are C0 
gruent. The fact that the intervening 
variables in the life situation of 
given individual may enter into al 
situation to prevent an individu 
from meeting any need to achieve 
might feel is thus overlooked. di- 
Motives have both force and a 
rection. Present measures of nel 
achievement consider only the aia 
while neglecting the latter. If a ne 
willing to define motivation 8 é 
amount of energy expended, then : 
concept embodied in the idea of ‘ Wied 
to achieve” is legitimate. If one 
lieves that motivation cannot t 
measured in terms of energy ie 
nor amount of production, then 
Concept of the “need to achieve” W} 
ave limited usefulness, 


SuMMary 


Subjects matched for ability wer? 
assified as achievers or underachieV~ 
ers on the basis of their academ)? 
Srades. After completing the McCle i. 
and Achievement Motivation Tes¥ 
the French Need Achievement scale 
and the Edwards Personal Preferenc® 
Schedule groups of male and femalé 
achievers and underachievers wer? 
compared. Results indicated that noD® 
of the three need achievement scales 
differentiated achievers and under- 
achievers significantly, with the ex- 
ception of the French scale which did 
differentiate between male achievers 
and underachievers. The only signifi- 


el 


A > 5 


NEED ACHIEVEMENT SCALES AND ACADEMIC SUCCESS 


c . 
ae i eo was found between 
ilies and the Edward’s need 
abs ent Scores for male achievers. 
erento variance indicated that 
among in source of variance existed 
tween ene Pictures, rather than be- 
achievers ups of achievers and under- 
ite ne a or between sexes. Implica- 
these findings as they related 


© concept a 
Ww pt of nee 
ere presented, d achievement 


REFERENCES 


Arxry, 
SO: « 
Wein % Studies in projective meas- 
npubli Re achievement motivation. 
Versit ished doctoral dissertation, Uni- 
Eowanps, ”4°f Michigan, 1950. , 
Personal W. Manual for the Edwards 
ork: P Preference Schedule. New 
'sychological Corporation, 1954. 


285 


Frencx, Exizaseta. Development of a meas- 
ure of complex motivation. USAF 
Personnel Train. Res. Cent. res. Rep., 
1956, No. TN-56-48. 

Lowett, E. L. The effect of need for 
achievement on learning and speed of 
performance. J. Psychol. 1952, 33, 31- 
40. 

McCuettanp, D. C., ATKINSON, a a 
Cuark, R. A. & Lowett, E. L. (Eds.) 
The achievement motive. New York: 
Appleton-Century-Crofts, 1953. 

Moraan, H. H. A psychometric comparison 
of achieving and nonachieving college 
students of high ability. J. consult. Psy- 
chol., 1952, 16, 292-298. 

Weiss, P., WERTHEIMER, M., & GROESBECK, 
B. Achievement motivation, academic 
aptitude and college grades. Educ. 
psychol. Measmt., 1960, 20, 663-665. 


(Received February 20, 1961) 


Journal of Educational Psychology 
1961, Vol. 52, No. 6, 286-291 


THE EFFECTS OF DIRECT ‘AND INDIRECT TEACHER 
INFLUENCE ON DEPENDENT-PRONE STUDENTS 
LEARNING GEOMETRY! 


EDMUND AMIDON 
Temple University 


Whether or not a particular type of 
student can learn when he is exposed 
to a particular style of teaching has 
interested a number of researchers, 
Smith (1955) and Wispe (1951) have 
both shown that when students are 
classified by the use of personality 
test data, they respond differently to 
highly organized versus loosely or- 
ganized classroom activities in a col- 
lege remedial reading course (Smith, 
1955) and to college lecturing versus 
group discussion techniques (Wispe, 
1951) in freshman sociology. The 
present project was concerned with 
dependent-prone eighth grade stu- 
dents who were exposed to consist- 
ently direct versus indirect styles of 
teaching while learning geometry. 

Asch (1951), Kagen and Mussen 
(1956), and Livson and Mussen 
(1957) have studied the reactions of 
dependent-prone persons in various 
kinds of experimental situations. 
They concluded that dependent-prone 
individuals are more likely to comply 
with authority figures and conform to 
group pressures than the less depend- 
ent-prone. Their results suggest that 
a dependent-prone student might be- 
come overly concerned with following 
the suggestions and directions of 
teacher and more dependent on sup- 
port and encouragement. The present 
project was designed to find out if 
these concerns inhibit or enhance the 


+This article is based on the PhD thesis 
of the first author. It was an adjunctive 
study of a larger project, the latter sup- 
ported by @ grant from the United States 
Office of Education. 


286 


AND 


NED A. FLANDERS 
University of Michigan 


learning of geometry at the age level 
represented in the eighth grade. 


ProcEeDURE 


This study employed a laboratory de 
sign in order to exercise experimental one 
trol of spontaneous behavior. First, 
behavior of the teacher was controlled ioe 
training a teacher-role player. His ata! to 
ments were classified by an observer re 
demonstrate that desired differences x 
great enough for students to notice. se i 
those students who scored high on a meee 
ence-proneness test, developed by Flan! te 
Anderson, and Amidon (1960) were select” 
for the experimental population. Third, oe 
trol of learning was accomplished by ae 
pre- and posttests of geometry achieveme’ 
And fourth, in half of the experiment 
Sroups the basic content material, press at 
m a tape recording, was so organized Ae) 
the immediate learning goals were uncleats 
and in the other half the immediate lea" 
ing goals were clear. 

The four treatments involved were: ; 

Treatment 1. Direct teacher influence’ 
clear goals, 35 dependent-prone students r 

Treatment 2. Direct teacher influence’ 
unclear goals, 35 dependent-prone students _ 

Treatment 3. Indirect teacher influence? 
clear goals, 35 dependent-prone students — 

Treatment 4. Indirect teacher influence’ 
unclear goals, 35 dependent-prone student® 

The experimental population of 140 de- 
Pendent-prone students was part of a larger 
Sroup of 560 students. The larger group Wa® 
exposed to the preceding four treatments 3° 
Sroups of 20 as part of a larger study. The 
560 eighth grade students were selected at 
random from Minneapolis and St. Paw! 
Public schools. The 140 students were the 
top 25% in each treatment of the larget 
Population according to their scores on the 
ependence-proneness test. 

The students were brought, 20 per ses- 
sion, to a spare room in a public gchool- 
First, a pretest of geometry achievement 
and a test of dependence-proneness were 
administered. Second, a tape recording was 


DIRECT AND INDIRECT TEACHER INFLUENCE 


Sa ag introducing the basic concepts of C 
ES ent = Speed x Time, and geo- 
cae Soncipts and formulae concerning 
immer angles. In half of the groups the 
> late goals were made clear because 
‘On ence explained how this informa- 
other hats be used to solve problems; in the 
e it » the goals were less clear because 
not be ents were warned that they could 
used aon how this information could be 
tease: hird, in the direct treatments a 
question © a 15-minute lecture, with a few 
trating 8, explaining the material and illus- 
Sin ;problems that could be solved. In 
ae treatments, a teacher conducted 
erial ea e discussion explaining the ma- 
be poly illustrating problems that could 
ime in ah The content coverage was the 
= stadent contrasting treatments. Fourth, 
Practiog alleen had about 15 minutes to 
Worl nese ving problems at their seats by 
Posttest oF 4 problem sheet. And fifth, the 
ereiti: achievement was administered. 
- © Sequence lasted 2 hours. 
the fa PPropriate points in this procedure 
ents’ perceptions of goal clarity and 


ir A 
» ured Perceptions of the teacher were meas- 


ability ¢ P2Per-and-pencil tests, The reli- 
by ibd th of these scales was estimated 
Analysis ae Of the Hoyt-Stunkard (1952) 
Mated ra Yariance technique. The esti- 
Scale Wa, lability of the student perception 
for the S .64, while the reliability found 
the Boel Perception measure was .93 for 
discussion administered just before the 
teache °n (Amidon, 1959). Whenever the 
teache, t#lked, an’ observer classified all 
Flang 2 28d student statements according to 


lay 
ie ee (1960) system of interaction anal- 
Obge Ater the validity and reliability of the 


ing he S judgments were verified by study- 
© tape recording that was made of 


eve 
©xPerimental session. 
ResvuLts 
G 
Grol of Teacher Influence and 


Oal Perception 

The manipulation of direct ae 3 

rect teacher influence occurred NE 
ding when the 


af rT 
®r the tape Teer tact with the 


ach me in ¢ 
eee The “same role player acted 


8 teacher in all treatments to avoid 


; ality and appear- 
differences in pee between the di- 


ance, differen® 
Tect = Sndirect appreehi mes shown 


287 
TABLE 1 
PERCENTAGE OF TALLIES IN INTERACTION 
CATEGORIES 
Treatment 
Category definition 
1 2 3 4 
Teacher talk: 
Praise and encouragement} 1.35) 1.61] 17.04) 14.90 
Clarification and develop-| 2.48] 0.92) 15.78} 16.10 
ment of ideas suggested 
by students 
Asks questions 2.58] 1.78] 28.07] 30.04 
Gives own opinions and | 63.10) 61.40) 13.52) 15.97 
facts (lectures) 
Gives directions 8.67) 10.36) 0.28) 0.27 
Criticizes students 13.03] 15.54) 1.27] 0.94 
Student talk 5.07] 5.29) 16.47] 17.17 
No one talking 3.49] 3.45] 7.75) 4.69 
Total tallies on which the | 889 | 869| 711 | 746 
percentage figures are 
based 


in Table 1 according to the percentage 
of statements classified into interac- 
tion categories. 

The figures in Table 1 show that 
essential differences between the di- 
rect and indirect treatments are: the 
teacher lectures and gives more direc- 
tions in the direct treatments; he asks 
more questions and gets more student 
participation in the indirect treat- 
ments; he praises, encourages, and 
clarifies student ideas more frequently 
in the indirect treatments; and he 
criticizes students more frequently in 
the direct treatments. 

The fact that the teacher controlled 
his behavior successfully and created 
the two teacher styles is self-evident, 
A Darwin (1959) chi square analysis 
of these same interaction data after 
they were tabulated in a matrix of 
sequence pairs, was calcula 
the null hypothesis that acts be Vom 
ference between interaction data of the 
direct and indirect combined . he 
os ae “g chi square value found 

2 (df = 90). This value, 


288 


TABLE 2 
StupenT Perception oF 
TeacHER INFLUENCE 


Fobs 


between 
Group Mean direct and 

indirect 

groups 
Direct influence 11.74 78.40 
Indirect influence 18.84 


transformed to a z score of 24.1 indi- 
cates that the differences could have 
occurred by chance with a frequency 
of much less than .01, 

The reliability of the observers who 
classified the statements in the live 
situation was higher than .90. Tt can 
be shown that the errors of observa- 
tion, at this level of reliability, are 
extremely small compared with the 
difference shown in Table 1 between 
the direct and indirect patterns. 

After the lectures and discussions 
and before the period devoted to work 
sheets, the students responded to a 
number of opinion items combined 
into a scale which measured their per- 
ceptions of the teacher’s behavior, An 
analysis of variance was made of 
these scale scores which indicated that 
the F ratio between the groups sub- 
jected to the direct and indirect influ- 
ence treatments was 78.4 (df = 1 
136). This was significant at the .01 
level of confidence. The mean scores 
indicated that students in the direct 
treatments more often saw the teacher 
as: “telling us what to do,” “firm and 
businesslike,” “making plans for us,” 
“eritical of our ideas,” “talking more 
than the students,” and “not using 
student ideas or suggestions.” In the 
indirect treatments, students often 
marked: “finding out what we know,” 
“relaxed and cheerful,” “letting us 
make our own plans,” “letting stu- 
dents talk,” “using our ideas,” and 
similar perceptions that were the op- 
posite of the direct pattern, The 


EDMUND AMIDON AND NED A. FLANDERS 


means and F ratio are presented 2 
Table 2. . 
As expected, an analysis of are 
tion data and of student percep ae 
of the teacher comparing vg il 
and unclear goal treatments pene re 
no significant (p < .05) aga aaace oe 
teacher behavior. Also, no sig?! et? 
(p <_.05) interaction effects 
found in the analysis of variance. ail 
The interaction analysis en 
measures of the students’ percep “i 
of the teacher did show that me ine 
ferences between the direct eo 
direct approach did exist as req wer? 
by the experimental design, rvely 
clearly seen by a trained obse 
and noticed by the students. 5 the 
Paper-and-pencil measures © the 
clarity of goals were made after woe 
playback of the tape recording 12 hf 
ducing the basic geometric we 
Students responded to items such a 
“Can you sce all the steps nee 
to finish your work?” “Right now 8 
you see what you will be yo 
clearly?” “Can you picture your wei 
So clearly that you could ane 
you will be finished?” and sim = 
items that were combined into a S¢@ le 
An analysis of variance of these 8¢* 
Scores yielded an F ratio of oe 
(df = 1/136, p < .01). An of 
tion of the means indicated that + i 
results were consistent with the =< 
tended goal manipulation. The me 
show that the immediate response is 
students to the clear and unclear oneF 
recording was significantly differe? 


The means and F ratio are found ” 
Table 3. 


. t 
Results of the Geometry Achieveme” 


Since achievement in geometry wa 
the fundamental outcome yariablé 
analyzed in this study, the post- 
achievement test was subjected to sev- 
eral analyses. The first analysis was 
the comparison of Postachievement 


DIRECT AND INDIRECT TEACHER INFLUENCE 2 


Sco e 
ean ee the indirect and direct 
found ae uence groups. The F ratio 
i nee analysis was 7.67, which 
Means of then at the .01 level. The 
groups yw he indirect teacher influence 
the Ue fe higher than 
ence groy of the direct teacher influ- 
Measure i. on the postachievement 
Counted n order to reduce unac- 
Postach} or error in the analysis of 
inice ne two analyses of co- 
Scores ange preachievement 
7 rae mene scores were run. 
Teet, teach 10 between direct and indi- 
When int er influence groups was 10.03 
9.62 4 clligence was controlled, and 
trolled eZ Pretest scores were con- 
Nificant Ape those F ratios were sig- 
SUperionit, the .01 level, indicating the 
fhienee a. of the indirect teacher in- 
Of coy Sroup. For each of the analyses 
g uanee df = 1/187. 
Va Hanes wivsis, of variance and co- 
Nificant Just discussed yielded insig- 
(p > 05) results when the 
Te as goal perception 
interaeti compared statistically. 
et ine 1on of goal perception and 
“8hificant uence also did not yield 
(p < .05) results. The F 
d in these analyses are pre- 


e Table 4, 

Means of the indirect group 

Significantly higher than the 
S of the direct group. This is true 
and preachievement 
s also true 
olled. The 
indirect 


wh 

Were Mtelligence 
When ontrolled, and it wa 
ey were not contr 


Me, 
"Ms of the direct and 


TABLE 3 


Stuppyp Perception oF THE Gost 


Fobs 
between 
clear and 
unclear 
groups 


ie tee 


it 
Sar goals 26.13 


Stouy Mean 


289 


TABLE 4 
Anatysis OF DIFFERENCES IN THE MEANS 
oF PosTACHIEVEMENT SCORES WITH 
Various Factors ConTROLLED 


STATISTICALLY 
Source of variation | df | Fobs Null hypothesis 
With no measure 
controlled: 
Interaction 1| — | Not rejected 
Teacher in- | 1 | 7.67| Rejected 
fluence 
Goal percep- | 1| — | Not rejected 
tion 
Preachievement 
controlled: 
Interaction 1 | 2.71) Not rejected 
Teacher  in- | 1 |10.03| Rejected 
fluence 
Goal percep- | 1| — | Not rejected 
tion 
Intelligence con- 
trolled: 
Interaction 1 | 1.08) Not rejected 
Teacher in- | 1 | 9.62) Rejected 
fluence 
Goal percep- | 1 | 1.13} Not rejected 
tion 
TABLE 5 


Means oF Direct AND InpIREcT GROUPS 
FOR PosTACHIEVEMENT MEASURE 


Adjusted Adjusted 


. 
Not | py pretest _ by in- 


Mean adjusted ™ Scores telligence 
Direct group 9.24 9.31 9.30 
10.82 10.76 10.69 


Indirect group 


teacher influence groups are presented 
in Table 5. 


Discussion 


The measures of geometry achieve- 
ment indicate that the dependent- 
prone students learned more in the 
classroom in which the teacher gave 
fewer directions, less criticism, lesg 
lecturing, more praise, and asked mor 
questions which inereased their verb 1 
participation. This finding takes = 
added significance when compared 


290 


with Flanders’ (1960) findings that the 
total group of 560 students, those scor- 
ing high, average, and low on the de- 
pendence-proneness tests, failed io 
show the same significant differences 
under the same conditions. Moreover, 
when the independent-prone students 
(those in the lower 25% on the de- 
pendency scale) were compared sepa- 
rately, no differences were found. 
Compared with students in general, 
dependent-prone students are appar- 
ently more sensitive to the influence 
pattern of a geometry teacher. 

The authors are disposed to inter- 
pret these findings in terms of the 
probable effects of teacher influence on 
the dependent-prone student. We as- 
sume that dependent-prone students 
are more sensitive to the directive as- 
pects of the teacher’s behavior. As the 
teacher becomes more directive, this 
type of student finds increased satis- 
faction in more compliance, often with 
less understanding of the problem solv- 
ing steps carried out. Only when he is 
free to express his doubts, to ask ques- 
tions and gain reassurance, does his 
understanding keep pace with his com- 
pliance to the authority figure. Lack- 
ing this opportunity, compliance alone 
may become a satisfactory goal and 
content understanding may be subor- 
dinated to the process of adjusting to 
teacher directives. It is interesting to 
note that this effect occurs even when 
the material being learned concerns 
an orderly, logical system, such as 
exists in geometry. 

One implication of this study is that 
closer supervision through the use of 
direct influence, an all too common an- 
tidote to lower achievement, may be 
more harmful than helpful for depend- 
ent-prone students. 


SumMAry 


The primary purpose of the study 
described here was to determine the 


EDMUND AMIDON AND NED A. FLANDERS 


effects of direct vs. indirect teacher 
behavior and of clear vs. unclear stu- 
dent perception of the learning goal 0” 
the achievement of eighth grade geo™ 
etry students. A specially be 
teacher role played both a very diree? 
and a very indirect teacher in 4 es 
ratory situation involving 140 eigh 
grade pupils chosen from a larger ee 
ulation on the basis of high scores © 
a test of dependency proneness. 
students were randomly assigne tal 
one of the following four experimen : 
treatments: direct teacher influenc® 
with clear goals, direct teacher 3” 
fluence with unclear goals, indire? 
teacher influence with clear goals, a 
indirect teacher influence with uncle 
goals, sa ceed 

Students in the various classifier” 
tions were then compared on the ag 
of pre- and postachievement tests } 
geometry. No differences were ae 
between the clear goal and uncle 
goal treatments, indicating that ™ 
this study, at least, achievement of vf 
pendent-prone students was not ® 
fected by perception of the learniné 
goal. An analysis of the direct and 1” 
direct treatments indicated that - 
children taught by the indirect teach® 
learned more than did the childre? 
taught by the direct teacher. 

The results of this study take 
additional meaning when compaté 
with the results of Flanders (1960) U 
img the same experimental desis™ 
Flanders found no differences (amo™! 
the four experimental conditions) 
the total group of 560 students wh? 
tanged from very high to very low 9 
the dependence scale. Apparently de 
pendent-prone students are more S¢?” 
Sitive to types of teacher jnfluenc® 
than are independent-prone gtydent® 
or students who make average score? 
on the test for dependence Pronenes® 


- 


DIRECT AND INDIRECT TEACHER INFLUENCE 


REFERENCES 


Aamoy, 
pa ae Dependent-prone students in 
publish Bee learning situations. Un- 
ator ME | doctoral dissertation, Univer- 
Ascr, § 5 Eada 1959. 
the ar a of group pressure upon 
mente | cation and distortion of judg- 
- In H. Guetzkow (Ed), Groups, 


adership . 
tne as em abr Che 
N, J. H. Note on the comparison of 


Sey, Fane 

lee realizations of a Markoff chain. 

Thanpeps Ny @, 1959, 4.6, 412-419, 
attitiides A. Teacher influence on pupil 
ind achievement. Final Re- 
rie _ University of Minnesota; 
i et eats United States Department 
pilucation, and Welfare; Co- 
Pia tteation, esearch Program; Office of 

S. 

By’ oe Anperson, J. P., & Ammon, 
‘suring dependence proneness 


291 


in the classroom. U. Minn., Bur. Educ. 

Res. res. Memo., 1960, No. Ber 60-6. 
Horr, C., & SrunKarp, C. L. Estimation of 

test reliability for umrestricted item 


scoring methods. Educ.  psychol. 
Measmt., 1952, 12, 756-758. 
Kacex, J. & Mussen, P. Dependency 


themes on the TAT and group con- 
formity. J. consult. Psychol., 1956, 20, 
19-27. 

Lrvson, N., & Mussen, P. The relation of 
control to overt aggression and depend- 
ency. J. abnorm. soc. Psychol., 1957, 55, 
66-71. 

Sutu, D. E. P. Fit teaching methods to 
personality structure. High sch. J., 1955, 
39, 167-171. 

Wisprz, L. C. Evaluating section teaching 
methods in the introductory course. J. 
educ. Res., 1951, 45, 161-186. 


(Received March 13, 1961) 


Journal of Educational Psychology 
1961, Vol. 52, No. 6, 292-296 


THE INFLUENCE OF INTERRACIAL CONTACT ON 
SOCIAL ACCEPTANCE IN A NEWLY 
INTEGRATED SCHOOL! 


STATEN W. WEBSTER? 


University of California, Berkeley 


Does interracial contact lead to im- 
proved attitudes and increased social 
acceptance among the participants? 
This question has been the subject of 
a number of investigations, most of 
which have involved white and Negro 
adult subjects. 

Data reported in these studies have 
been contradictory. A number of in- 
vestigations (Brophy, 1945; Deutsch 
& Collins, 1952; Neprash, 1952; Wil- 
ner, Walkley, & Cook, 1952) have in- 
dicated the successful influence of in- 
terracial contact in reducing prejudice. 
However, a number of negative find- 
ings have also been reported (Mussen, 
1950; Rose, 1947; Young, 1932). 

Those who argue against the posi- 
tive effects of contact cite the restric- 
tive influence of racial and ethnic 
group norms upon individuals in such 
situations. The existence of such 
frames of reference has been clearly 
documented by Bogardus (1959). 
While the impact of racial or ethnic 
group membership upon the individual 
has been established, there is also evi- 
dence that under certain conditions 
individuals and groups do modify their 
interracial attitudes, feelings, and be- 
haviors. An analysis of successful con- 
tact situations seems to reveal three 
important requisite factors: equal 
status of the participants in the situa- 

1 This study is abstracted from S. W. Web- 
ster’s doctoral dissertation written in partial 
fulfillment of the requirements for the de- 
gree of Doctor of Philosophy at the Uni- 
versity of California, Berkeley, 1960. 

2 Now Associate Research Director of the 
project, Studies in the Dynamics of Admin- 


istrative Interaction, at the School of Educa- 
toe University of California, Berkeley. 


tion, contact of sufficient duration a 
intimacy, and the sanction or supP? 
of a higher authority. 


PROCEDURE 
Problem 


In view of the dearth of contact resent’? 
involving public school settings, this Dare 
gation sought to study the effects of her 
racial contact in a newly integrated nort 
California junior high school. | ‘e iD- 

Specifically, two major questions were - 
vestigated: Does interracial contact, Next? 
6-month period, between white and Ne o 
subjects, lead to increased expressions 
social acceptance of one group for the ot é 
Are precontact social acceptance scores as- 
measured by a social acceptance seale) ter- 
sociated with the eventual making of iD 
racial friendships? igh 

Setting. The school board of ®@ Eo 
school district in the San Francisco 1039 
Area reached a decision in the spring of aie 
to effect certain changes in the bound"! 
Surrounding its junior and senior bi! c- 
schools. These realignments were made ne e 
essary by overcrowded conditions in sore 
schools, and the unequal distribution of cb! 

ren in terms of socioeconomic status. Z 

Prior to the boundary modificatioD® 
which became effective in September 1959: 
Jackson Junior High School* was populate? 
by an all white student body. The compo! 

ion of the school population was alter 
greatly by the boundary changes, with 80 
300 Negro children being integrated into 
school’s present total of some 1,400 students 
This school, its feeder elementary schools 
and its surrounding communities served # 
the setting for this study. While there wel 
Some socioeconomic differences between th® 
white and Negro communities, a study of thé 
personal histories of the children yeveale 
these to be minor in nature. 

Subjects. When the new school boundaries 
were announced late in the spring, it became 


* Actual school names are not used in thid 
report. 


292 


SOCIAL ACCEPTANCE IN AN INTEGRATED SCHOOL 


obyi 
iy whee most children in two elemen- 
a ‘truly ee be attending, in the fall, 
tite, Walle egrated school for the first 
all white er Elementary School, which was 
Jackson * Senty 60 sixth grade graduates to 
the alinect care High in September. From 
transfors (Th Negro Newell School came 44 
© gome 005 us school contained, in addition 
Indian a oe students, 17 children of 
Fas exican-American extraction.) 

comprised a similar control group 
© selected 55 whites and 53 Negroes was 
ior high An all White community and 
Provided a school, in a neighboring city, 
egr0 cont e white control subjects. The 
most nto eed were selected from an 
the © junior high school in 
Mexie Pred city; with dhe axeanncn of 
Atter schoo Fag tat pe and white children, the 
Compose ol's student population of 450 was 

; Upoth wtely of Negroes. 
tigateg in a Four hypotheses were inves- 
nls study; they were as follows: 


y 5 
Who Tothesis I: White and Negro subjects 
§ Schoo] sippetienced intergroup contact in 
Fs Ore so gin’ ation will express significantly 
a h acceptance of each other after a 
Doth od Of time. 
( ae I: The postcontact social 
Perimental Cores of the white and Negro ex- 
Ore out. Subjects will display significantly 
°onta, ietiahs acceptance than will the non- 
oth Tes of the control subjects. 
esis III: The precontact social ac- 


Cepta 
Sets vill apres, of the experimental sub- 
heir later ¢.0. Significantly associated with 


¢r friendship choices 

neg pothesis IV: The retest social accept- 
be Sig ores of the experimental subjects will 
*aciay fh ificantly associated with their inter- 

"endship choices. 

potheese hypotheses are interrelated. Hy- 
fe “ses I and IT seek to measure the impact 
ligh, subjects, in 
" ‘ol subjects. Hy- 


10n, 
the Jl data on the effects © 
Use ylidity of the social acceptance scale 
to gather these data. 


Meu, 
od 
was deemed most 


” he test-retest design of study. Both ex- 
rinetiate for bei Yre tested initially in 
. ental eae they were still attending 
thei 1959, whi le mentary Gove ae were 
rot, Particular a gchool in March 1960. 
ested at ae factor, it was impossible 


Because of the t# 


: 293 


to test the control groups in spring 1959. The 
failure to obtain these data is, admittedly, 
a weakness of the study. However, since their 
elementary and junior high schools were 
relatively homogeneous in racial composi- 
tion, it seems valid to assume that their 
single test scores (spring 1960) on a social 
acceptance scale (see below) would not differ 
significantly from the precontact scores of 
their matched experimental groups. 

All testing sessions were conducted by 
members of the subject’s racial group. To 
avoid the possibility of inhibiting the re- 
sponses of the children, they were told that 
the questionnaires were anonymous. How- 
ever, although the experimenter entertained 
some reservations about the procedure, in- 
visible ink markings were used on the social 
acceptance scale to identify the respondents. 

To measure social acceptance, a nine-item 
social acceptance scale was constructed. The 
respondent was asked to rank his willingness 
to accept members of four groups (Chinese, 
Japanese, Mexicans, and whites or Negroes) 
in nine social situations. The social situations 
ranged from the less intimate (e.g., working 
in the same factory) to the very intimate 
(e.g., living next door). Scale scores from 
0 to 9 were possible, with the higher scores 
representing the greater acceptance. The 
scalability of the instrument was established 
in keeping with the criteria suggested by 
Guttman (1950). The coefficient of repro- 
ducibility was .92. 

In addition to the social acceptance scale, 
a brief sociometric friendship questionnaire 
was used. The subjects were asked to indi- 
cate their six best friends in the school and 
to give the ethnic or racial backgrounds of 
these friends. These data were obtained after 
the retest of the experimental subjects. 

To test the above hypotheses, it was nec- 
essary to validate the assumption that both 
experimental and control groups were ini- 
tially similar in levels of social acceptance. 
The two-tailed application of the Kolmogo- 
roy-Smirnov two-sample test (Siegel, 1956) 
was used in this analysis. Since the KS test 
is sensitive to differences in central tendency. 
dispersion, and skewness, it seemed espe- 
cially appropriate for this purpose. The com- 
parison of the experimental subjects’ pre 
Son eee the single test scores of 

re heey 
ia nee me meee 


Resuuts 


Table 1 contains a sum 
; mary of t 
basic test data on the at 


294 


TABLE 1 
Test anp Rerest Scores or EXPERIMENTAL 
AND ConTROL Groups on a Socrau 
AccEPTANCE ScALE 


Subjects Group |w| =| x | =| sp 
& = 
Experimental: 
Pretest White |60/0-9/1.85| 0 |2.32 
(June 1959)| Negro |44/0-9|6.05| 9 |2.36 
Posttest White |60|7-0)1.48) 0 |2.02 
(March Negro /44/9-0|6.32| 9 |2.61 
1960) 
Control: White |55/0-9|2.54| 0 |2.24 
(March Negro |53/0-9|5.98| 6 |3.97 
1960) 


and control groups. The reader will 
note the great discrepancies between 
the scores of the Negroes and the 
whites. Bogardus (1959) has reported 
similar distributions in one of his stud- 
ies of social distance. 

Hypothesis I. The Wilcoxon 
matched-pairs signed-ranks test (Sie- 
gel, 1956) was used to test the first 
hypothesis. This nonparametric test 
makes use of comparisons between 
pairs of scores and takes into consid- 
eration both the magnitude as well as 
the direction of the differences be- 
tween the sets of scores. Change for the 
white subjects was in the negative di- 
rection and significant (2 = 3.96,p < 
.05). Change for the Negro subjects 
was positive and significant (z = 2.12, 
p < .05). Thus, the predicted hypothe- 
sis was confirmed only in the case of 
the Negro subjects. 

An equal number of Negro subjects 
changed in each direction. However, 
the magnitude of change was greater 
for those subjects whose gains were 
positive. ; 

Hypothesis II. The testing of Hy- 
pothesis II involved comparing the 
postcontact scores of the experimental 


+The .05 level of significance was used in 
testing all hypotheses. 


STATEN W. WEBSTER 


groups with those of their ona 
an attempt to see if the contact pee. 
tion had produced significant di 
ences among the groups. _ the 
The one-tailed application ot 
KS test revealed a significant di hite 
ence between the scores of the W! 2 
experimental and control groups OCs 
7.23, df = 2, p < 05); however, 4) 
difference was in the nonpredicte e- 
rection. The obtained difference ri- 
tween the scores of the Negro expe ie 
mental and control subjects was a 
significant (x* = 2.16, df =2,P > «>; 
Hypothesis III. This hypothesis } 
based on a suggestion by petit 
(1952). He speculated that the m4 act 
of out-group friendships in & prod 3 
situation may be simply a matte ard 
favorable precontact attitudes ea at 
that group and not a result of con a2 
To test this assumption the cial 
tact scores of the experimental ae 
jects were dichotomized at the medi 
and a comparison was made of di- 
number of out-group friendships 1? ag 
cated by each group. The x? test W = 
used in the analysis and the Andint 
were not significant. The median, 
this case, happened to fall close to st 
derived zero intensity point (discuss 
later). t 
These findings seem to indicate tha : 
in this study, precontact attitu ee 
played little or no part in the veiling 
ness to make out-group friendships ? 
the contact situation. The making ° 
interracial friendships, it appears, il: 
related to the effects of contact itsel!: 
Hypothesis IV. Tt. was predicted 1 
Hypothesis IV that the subjects’ retes 
social acceptance scores would be sig 
nificantly associated with their ¢ 
ported interracial friendships. In test 
ing this assumption, use was made 0 
the intensity measure suggested bY 
Guttman (1954) and his associates. 
While the method of deriving this 
score will not be fully discussed, # 


a 


SOCIAL ACCEPTANCE IN AN INTEGRATED SCHOOL 


see description will be given. The 
or low intensity point is a meas- 
3 @ Se seeks to establish a psycho- 
te ly meaningful point of low in- 
the nic on an attitude scale. Through 
skin, of the extreme responses of the 
oa to each scale item (Answer 
me 1 or 4), it is possible to 
int mize the respondents into three 
i a! groups: intensely positive sub- 
indi intensely negative subjects, and 
* oe or low intensity subjects. 
Balls pe of the social acceptance 
sity ‘ed in this study, the zero inten- 
E Point was found to be located at 
Scale score of four. 
the e Subjects were dichotomized on 
since of their intensity scores, and 
er of Ysis was then made of the num- 
reported by eee friendship choices 
result ry each group. The obtained 
2 pes Significant (x7 = 10.56, 
Was th f a Sri Hypothesis IV, 


Discussion 


ay 7 
seg, —— effects | presumably as- 
Not with interracial contact were 
then ap in this study. What, 
Recount | some factors which seem to 

for the obtained results? 

irs Ough the three-part criterion 
in Fae earlier in this paper was met 
thas 18 contact situation, it is possible 
Cont, Only more general and intimate 
(Bp act between interracial groups 
lggoPhY, 1945; Deutsch & Collins, 
sj 2; Wilner et al., 1952) results in 

(nificant modifications of attitudes. 
: Should be remembered that the sub- 


c : «4 the integrated 
» While attending Ts ia racially 


Scho : 

slo 2 iat Me t is possible that 

Sed communities. It 18 P ie 

= €gration within the total social en- 
onment is necessary» for -— ate 
Udes and values which can be ava a 

°ed in the school setting are pro ably 

Contradieted by the realities of the re- 


Stricted community 


295 


The length of the contact situation 
in this instance could be another factor 
that partially explains the failure of 
the contact hypotheses. A 6-month pe- 
riod of time could very well be inade- 
quate to allow for the necesary inter- 
group adjustment of the subjects. 
Although this period of contact was 
longer than that reported in a number 
of studies, subsequent investigation 
may show that the initial assumptions 
associated with contact experiences are 
valid. 

The social acceptance scores of both 
white and Negro subjects provide 
another possible explanation. Social 
distance (acceptance) research by 
Bogardus (1959) and others has docu- 
mented the existence of a hierarchy of 
racial preferences. These normative 
frames of reference appear to be held 
both commonly and consistently by 
members of various racial or ethnic 
groups. Traditionally, whites rate 
Negroes in their preferences low on & 
social distance scale, while Negroes 
tend to rate whites quite highly. The 
data obtained in this study reflect 
these phenomena. While social dis- 
tance measures have proven to be 
both reliable and valid, it is possible 
that they are not sensitive enough to 
measure less noticeable changes within 
individual attitude structures. Meth- 
ods and instruments which assess in- 
terracial attitudes with greater depth 
and precision would seem to be needed. 

An interesting finding of the study 
was the significant association of in- 
terracial friendship choices with post- 
contact social acceptance scores but 
not with pretest scores. This finding 
seems to suggest caution in accepting 
the responses of subjects to social dis- 
tance and other types of attitude 
measures as predictors of behavior. 
Many —— whose precontact re- 
sponses were favorable or hi 3 
cepting of the other group ie 2 


296 STATEN W. 


make eventual interracial friendships, 
and in a number of cases completely 
reversed their scale scores from high 
acceptance (scores of 7-9) to high re- 
jection (0-2). 

The data obtained in this study 
seem to suggest a critical reappraisal 
of the contact hypothesis. Factors ex- 
ternal to the setting of the interaction 
as well as those associated with racial 


and ethnic group membership must be 
considered. 


SumMMary 


This study tested three major hy- 
potheses relating to interracial con- 
tact, in a California junior high school, 
between white and Negro subjects. It 
was held that (a) a school contact 
situation of 6-months’ duration would 
lead to increased interracial social ac- 
ceptance by members of the two racial 
groups, (b) that a comparison between 
experimental subjects who experienced 
interracial contact and similar control 
subjects who did not would reveal 
greater interracial acceptance among 
the former group, and (c) that precon- 
tact social acceptance scores of the 
experimental subjects would be associ- 
ated with out-group friendship choices 
made in the contact setting. 

The test-retest design was used with 
60 white and 44 Negro subjects, and 
55 white and 53 Negro control sub- 
jects. A single administration of the 
social acceptance scale took place with 
the control subjects. All tests were 
administered by adults of the particu- 
lar subjects’ racial groups. 

Contact had a negative effect upon 
the white subjects; they became sig- 
nificantly less accepting of Negroes. 
The findings were inconclusive in the 
case of the Negro subjects, but did 


WEBSTER 


tend to indicate that change ba 
greater in the direction of more ed 
ceptance of whites. Precontact a 
acceptance scores were not foun hip 
be associated with later friends : 
choices. However, postcontact pe 
ance scores were associated with 
tual out-group friendship choices 
the .01 level of confidence. 


J 
‘ es. 
Bocarpus, E. Race soaps by sexe 
Sociol. soc. Res., 1959, 43, 4 jue 
Bropuy, I. The luxury of anti-Negra, es 
dice, Publ. opin. Quart., 1945, 7» 
466. eft 
Devrscx, M., & Cortins, Mary Boil 
fects of public housing upon 12 ae 
attitudes. In G. Swanson, T. ™- 


in social psychology. (Rev: 

York: Holt, 1952, Pp. 582-593. vais, 
Gorman, L. The basis of sealogram an0Y"p, 

In S. Stauffer, E. A. Suchiie’ y, Ae 

Lazarsfeld, Shirley A. Star, d prem 

Clausen (Eds.), Measurement oTiniver 

diction. Princeton: Princeton 

Press, 1950. Pp. 60-90. f scale- 
Gurrman, L. Principal components 0 (Bd.): 

ble attitudes. In P. F. Lazarsfeld | ] sci 

Mathematical thinking in the 8002’ O54. 

ences. Glencoe, Ill.: Free Press, 

Pp. 229-233. ocial 
Mussen, P. H. Some personality and 4 en's 
factors related to changes in chil cate 

attitudes toward Negroes. J. ab” 

soc. Psychol., 1950, 45, 423-441. ocial 
Neprasu, J. Racial group contacts and § 

distance. Phylon, 1952, 16, 207-212. ej 
Ross, A. M. Studies in reduction of PTO, 

dice. Chicago: American Counel 

Race Relations, 1947. the 
Strcet, S. Nonparametric statistics for 


pede sciences. New York: Me 
raw-Hill, 1956. 
Winer, D. M, Watxtiey, R. R., & Coors 


. Sate 
8S. W. Residential proximity and tf 
froup relations in public housing P 


ects. J. soc. Issues, 1952, 8, 45-69. New 
oune, D. American minority peoples. 
York: Harper, 1932. 


(Received April 3, 1961) 


Journal of Edi i 
lucational Psychol. 
1961, Vol. 52, No. 6, 207-302 aia 


TEACHERS’ PERSONALITY RATINGS OF 


HIGH SCHO 


OL PUPILS 


H. J. HALLWORTH 


Department of Education, University of Birmingham, England 


Fey contributors to this journal 
init he rie the need for research 
& ois ners’ ratings of their pupils, 
Futons which is equally pressing in 
article = in America. The present 
search 5 poe part of such a re- 
et hich was carried out from the 
ersity ee of Education in the Uni- 
see Birmingham, England. It 
tom a i canes a sample drawn 
tom th ferent educational tradition 
= studied, for example, by 
: ve and Holland (1959). The 
a —— are remarkably com- 
ny dis 0 theirs and, with one excep- 
Catirely Nie sei appear to be due 
strumente differences in measuring in- 
te and statistical procedures. 
Such as Oped that comparative data 
Amerie this will be of interest to 
Confirmat; workers both in affording 
Ugpestin ion of their findings and in 
ay & small differences which 
Prove illuminating. 


Vv 


f 


Syn: Merrnop 
Ubjects 


most English 
to secondary 
with the high- 
t are allocated 
the remainder 


A 
Childe” age of 11+ years 
Schools pass from primary 
est ie Approximately 20% 
to putelligence and attainmen 
Tammar schools; most of 


Dags 5. 
Ranto secondary modern sch 


tings were obtained for 
Year, 

8) 
phools 


single-sex Fat 

gion near Birming- 

h e selected in three 
ising a boy’s 

3 secondary mod- 
ocality. The subjects 
according to age and 
roups, each distrib- 
and containing ap- 


boy 
Schoo] within 
mee therefore div: 
Ool type into 
ted over three 8° 
Proximately 112 boys ated 
_It is commo?. a person 
differences in ce” 


that there are sex 
ality traits: boys 


297 


aid 


are assumed to be more self-assertive, and 
girls more cooperative with adults. Such 
differences have frequently been demon- 
strated in mean scores obtained on personal- 
ity inventories (e.g., Castaneda, McCandless, 
& Palermo, 1956). Groups comprising mem- 
bers of both sexes may consequently be ex- 
pected to yield spuriously high correlations 
between certain personality traits. It was 
therefore considered necessary to obtain rat- 
ings for groups consisting of pupils of like 
sex only. However, in the complete research 
of which part is reported in this article, rat- 
ings were obtained for parallel groups of 
girls and yielded results comparable to those 
given below. 
Similarly, 
personality 


it may be claimed that some 
traits are more typical of one 
age than of another. Ratings obtained for a 
group with a wide age range may therefore 
have spuriously high intercorrelations. It 
was therefore considered necessary to obtain 
ratings for several classes of boys all of the 
same age level, and to combine these into 
one group. This was done for two age levels 
and for two types of school in order to de- 
termine the effect of age and school type 
upon the jntercorrelations obtained. 


Measures 


Each class teacher rated his pupils on 12 
personality traits, using a graphic five-point 
scale. In order to avoid spuriously high in- 
tercorrelations from another source, he was 
requested to complete all ratings on one 
trait at a time. For each trait there was pro- 
vided a brief description for each of the ex- 
treme ends of the scale, and a two-word 
description of each of the five points on the 
scale. The traits were as follows: 


. Emotional stability 

. Trustworthiness 

3 Segara 

. Cooperation with teach 
. Cheerfulness 
. Sense of humor 

ls Roo doulity 

. Self-assertion-si issi 

. Maturity ee 
. Popularity 

11. Confidence-anxiety 

. Spontaneity-withdrawal 


_ 
SOWMNAAN RW E 


a 
i) 


298 


When several teachers rate different chil- 
dren and the ratings on different traits are 
then intercorrelated, there is yet another 
Possible source of spurious correlation: some 
teachers give high ratings on all traits, some 
give low ratings. In the present research 
therefore, no attempt was made to use rat- 
ings made by different teachers when ob- 
taining correlations between traits. It was 
accepted that neither mean nor standard de- 
viation could be assumed to be comparable 
for ratings made by different teachers or, in- 
deed, for ratings made by one teacher on 
different traits. Teachers were therefore re- 
quested, when making ratings on any one 
trait, to take the mean of their own class as 
the mean of their rating scale. They were 
also requested to distribute the ratings ac- 
cording to the percentages of the normal 


curve, a graphic model being provided for 
this purpose. 


Analysis 


The data was analysed with the aid of an 
electronic computer, A correlation matrix 
was first obtained for the ratings made on 
the 12 traits by each teacher for the boys in 
his own class. Within each of the four groups 
the correlation matrices were then com- 
bined, using z Scores, to obtain a matrix of 
average intercorrelations between traits for 
each group. A principal components analysis 
was then performed on each of the four 
combined matrices? 


Resuits 


Across each of the four analyses the 
first two components were essentially 
similar and accounted for approxi- 
mately 60% of the total variance. Fur- 
ther components invariably accounted 
for less than 10% of the total variance 
and, since they had little similarity 
across the analyses, will not be con- 
sidered in this article. There is general 
agreement regarding the large error 
variance in teachers’ ratings. How- 
ever, when comparable factors are ob- 
tained from four different analyses, 


* Acknowledgements are due to W. L. B. 
Nixon, of the University of London Compu- 
ter Unit, whose Mercury computer principal 
components (analysis) Program was used, 
The program for the combination of correla- 
tion matrices, using # scores, was developed 
by the author in conjunction with the Com- 
puter Unit. 


H. J. HALLWORTH 


this suggests a certain regularity in 
manner in which ratings mace 
justifies further examination. 
: The first two components for ie 
of the four analyses are shown aaa 
accompanying diagrams, come oft 
being shown along the x axis on nae 
ponent II along the y axis. ane? 
centage of the total variance extr 
by each component is indicated. vais 

In the diagram for first year Sit 
mar school boys, a group of ual xes- 
falls close to each of the on 
Those falling near the axis for ¢ hu 
ponent I are cheerfulness, sense © on 
mor, sociability, self-assertion, atel¥ 
spontaneity, each with a Lari ae 
64% of its variance accounted wiling 
this one dimension, The ratings fa ox’ 
approximately along the axis for bility) 
ponent II are emotional sta "ae 
trustworthiness, persistence, coop ab 
tion with teachers, and maturity, © ae 
with between 38% and 66% of its Lies 
ance accounted for by this Lacey 
We may name the first dimension sity 
traversion” and the second “reliabilitY 
and conscientiousness.” Two other 197° 
ings, namely popularity and ey al 
dence, lie close together and betwe 
these dimensions. m- 

The diagram for fourth year gt 
mar school boys has the same 
clusters of ratings, albeit they are - 
this case less compact. A slight ereaee 
onal rotation would bring them oP a 
the main axes, Again the ratings ee 
confidence and popularity lie togeth 
and between the two clusters. 

he two diagrams for secondary 

modern school boys appear at firs? 
Sight to show the well-known “halo 
effect” which occurs when ratings 0? 
@ number of traits are made of an i0- 
dividual. All ratings have a heavy 
loading on Component I. For first yeat 
boys, however, the same two clusters 
may be seen, although the extravyer- 
sion traits are more widely spread. In 
the analysis for fourth year boys, one 


a 


saws 
NEI 
un 
— 


oO 
° 
o- 3h 
oh ny 


HN easunere 


i Ty, 
te 1. Pring? 
Wo tincipal components analyses for four groups: 
II = second componen 


= grammar scho' 
dary modern school boys 


Co: 

g m 

mnttacted: ‘Onents. (I = first component, 
Ol bo: Y each component is shown. GB: 

pod vs Year 4, SMB: = secon 


Coo: 4 
| Selfngcen eration with teachers, 
ttion-submission, 9 = 


| & ; 
. Ont, maturity, 
Neity-withdrawal.) 


of trust- 


lust, C 
Wont 18 made by the traits oe 
jonal sta- 


Orth; 
bitigeiess, persistence, emotl 
si ee and cooperation. The extraver- 
ihe} es are again scattered and now 

Ude among them the ratings Of con- 
Ree, popularity, and maturity- 


| 

Discussion 

% "ammar and Secondary Modern 

Chool Groups en 
It is possible th e Cae fd coeae 
f apparent halo efor POL noo 

| ond ren 5 ool groups is due 

~ndary modem *". vant factors in 


: e 
} to psychologicaly imethod of analysis. 


at th 


he data or iD 


TEACHERS’ RATINGS OF STUDENTS 


299 


7 I 
T234 56 ae 37:90% 
pi orl 
=e “12 
SMB4. =e é 
8 


ae 
: Loadings of twelve ratings on 
t. The precentage of variance 
ol boys Year 1, GB, = grammar 
Year 1, SMBs = secondary 

thiness, 3 = persistence, 


School boys Year 4. 1 = emotional stability, 2 = trustwor' 
5 = cheerfulness, 6 = sense of humor, 7 = sociability, 8 = 


10 = popularity, 1 


1 = confidence-anxiety, 12 = 


Further research is necessary to deter- 
mine the matter, and is currently in 
progress. A consistent difference re- 
Jated to school type appears, however, 
to demand a psychological explana- 
tion. 

There is no reason to suppose that 
the secondary modern school teachers 
had less acquaintance with their pupils 
and therefore less opportunity to dif 
ferentiate their ratings. Nor is ‘thane 
any reason to assume that their pate 
ings were in any other way made 
der less favorable conditions. “ 

It would appear, therefore, that if 


300 


any psychological explanation is re- 
quired of the difference between the 
two sets of analyses, it is probably to 
be found in the way in which the two 
groups of teachers “perceived” or 
thought about their pupils. 


Interpretation of the Principal Com- 
ponents 


Ryan (1958) reports high intercor- 
relations between all the traits which 
he used and concludes that “teach- 
ers rate on the basis of general im- 
pression.” This is, admittedly, a com- 
mon and reasonable conclusion. It is 
not, however, an invariable finding, as 
is shown by the above diagrams for 
grammar school boys, where litle halo 
effect is found. 

The interpretation of the two prin- 
cipal components for each group of 
grammar school boys is quite clear. In 
neither case is there any appreciable 
halo effect. Two dimensions, one of ex- 
traversion, one of reliability and con- 
scientiousness, account for three-fifths 
of all the variance. 

The problem arises when we con- 
sider the secondary modern school 
groups. 

If we interpret the two principal 
components as they stand, we shall 
label Component I “halo effect,” Com- 
ponent II is, however, difficult. to 
name, and here lies the difficulty: It 
must be described in terms of a scale 
running from Cooperation, trustwor- 
thiness, persistence, and emotional 
stability at one end, to sociability 
cheerfulness, spontaneity, self-asser_ 
tion, and sense of humor at the other 
Such a scale does not make good psy- 
chological sense. Moreover, the sug- 
gestion that the clusters which we have 
named “reliability and conscientious- 
ness” and “extraversion” are independ- 
ent dimensions for grammar schoo] 
groups, but opposite ends of a single 
seale for secondary modern school 


"HL J. HALLWORTH 


groups, does not make good common 
sense. ; 
At this point we may note that the 
factorial solution provided by, the 
components analysis is only one of 
many possible solutions. It has already 
been indicated that a small rotation is 
appropriate for the analysis of the 
fourth year grammar school group. It 
1S now suggested that the obvious solu- 
tion for the secondary modern groups 
1s to rotate the two components or- 
thogonally until the apparent halo ef- 
fect is at a minimum and we have the 
two dimensions already obtained for 
the grammar school groups. 4 

That this is practicable may readily 
be seen from the two diagrams. The 
extraversion dimension is indeeed not 
as clearly defined as could be desired, 
and it is possible that a still bette 
description would be in terms of a cir- 
cumplex model (Schaefer, 1959). The 
rotation does, however, produce better 
psychological meaning in the analyses 
for the Secondary modern groups, 40 
makes the analyses for all four groups 
comparable, 


Traits and Dimensions 


If we accept this solution, there re- 
mains to consider the difference be- 
tween the present research and certain 
American work, such as that already 
quoted, which has been interpreted in 
terms of halo effect only. The differ- 
ence is almost certainly due in part to 
different procedures employed in ob- 
taining ratings. Table 1 gives the cor- 
relation matrices for the two first year 
stoups, and it is readily apparent that 
the correlations are lower than those 
obtained by either Ryan or Holland. 

he most significant difference lies, 
however, in the nature of the traits 
used. Ryan, for example, used eight 
traits: seriousness of purpose, indus- 
try, initiative, influence, eoncern for 
others, responsibility, self-control | 


TEACHERS’ RATINGS OF STUDENTS 


301 


TABLE 1 


TuE 2 
Correvation Matrices ror TWELVE TEACHER RaTINGs FoR GRAMMAR ScHoon Bors 
Year 1 anp Seconpary Mopern Scuoon Boys Year 1 


Trait 1) 2,3 |, 4[s |e)ae fe lho lala} gp 
1. Emotional stabilit 7 
‘ s y 46} 40) 27|/—03|—15'—08/-—08| 40 37 |— 
7 Pio 54 ‘J 56) 53)/—03/—16]—16/—13) 49 o 2 me: 
co 5 57} 19|—11)— 5 3 
Fy Cooperation with teachers 51) 59] 37 04/—-16 “ot _19 Ps a i - 
®. Cheerfulness 43| 11] 25} 99 71) s8| 45\—18| 21 | 37 | 36 
Gi Saaneaie umor —07|/—35)/—11]—13} 67 57; 58'/—31) 21] 30] 55 
8. Belt ility A 19} 00} 10) 23] 75] 56) 66/—09} 24 | 44 7a 
5 ee 08/—28} 02)/—08} 56) 42) 66) 00 30 46 o 
10. ones 61} 62) 58} 49) 30)—15) 15) 11 26 | 27 |—06 
th Ganda y . 50) 31] 37| 44) 66) 36) 59) 32) 44 27 15 
12. em ence-anxiety 51] 25) 52) 24) 48) 15} 42] 38) 56] 53 31 
ntaneity-withdrawal 21/—08| O7|—05| 56) 49) 57; 59 18; 42 | 46 
) 1 1 ’ \ \ 


Note: = 
tions fa Correlations for grammar school boys Year 1 are] above the diagonal. Correla- 
secondary modern school boys Year 1 are below the diagonal. 


he; 

oe and personal appearance. All 
ites to fall on, or close to, the 

entiousnes of reliability and consci- 

Plies, in ess. The same comment ap- 

by Holl Seneral, to the 12 ratings used 

ity, 89 and except, possibly, popular- 
i cial leadership, and physical 


igor, ‘ 

Set 5 Tn neither case was any typical 
Would gotaversion traits used. It 
tain herefore be impossible to ob- 


obtai WO dimensions similar to those 
tien, in the present study, and the 
alo cqParable result would be the 
. HO effect that was actually obtained 
€ach case, 
Tbs the present research, 
© traits of popularity and 
nd to fall together and between the 


“VO main di é Here, possibly, 
ig in dimensions. 4 4 Pension 


ratings on 
confidence 


Suppose that yet other 
G z «fod, It is, however, 
Suld not be identified 4 te ot less 


’pparent that they W! 
repentance both uy 
etically, It is equa’, 
the dimension af TebADINY 


: so well identi- 
Scientiousness, are™ y this journal, is 
€d in earlier at 


ticles 1D 


of greater importance to teachers than 
is the dimension of extraversion. 


Ratings of Maturity 

The rating of “maturity” is of par- 

ticular interest in the present research, 
since in three analyses it is part of the 
dimension of reliability and conscien- 
tiousness, and in the analysis for 
fourth year secondary modern school 
boys it is closer to the extraversion 
traits. An interpretation of this dif- 
ference appears to demand some ac- 
quaintance with secondary education 
in England. 

Grammar schools in England are es- 
sentially academic schools and gram- 
mar school teachers value most highly 
in their pupils those traits which are 
associated with academic success. Both 
these teachers and, apparently, teach- 
ers of first year secondary modern 
school boys, regard as mature those 
pupils who are most trustworthy, per- 
sistent, emotionally stable and cooper- 
ative: in other words, the children 
oa hg Te most mature in their school 

Teachers of fourth 
modern school boys waive sts atte 
in their pupils more closely with those 


302 


traits which may be immediately use- 
ful in life outside school. Very few of 
these boys could be expected to remain 
at school beyond the end of their 
fourth year. It appears that their 
teachers were already assessing their 
maturity in terms of the social qual- 
ities they would require upon leaving 
school. 

The implication is that ratings of 
maturity have a large “evaluative” 
element. It is probable that most 
teachers, as Holland found, measure 
maturity in relation to the dimension 
of reliability and conscientiousness. 
There appear to be, however, circum- 
stances in which teachers relate ma- 
turity to other traits which they con- 
sider more relevant for their pupils. 


Conciusion 


The results of this research indicate 
that the extent to which trait names 
are interchangeable depends upon the 
traits with which we are dealing. If we 
refer to those traits with which teach- 
ers are commonly concerned and which 
in this article have been designated as 
reliability and conscientiousness, then 
indeed they are interchangeable as 
Ryan suggests; to measure one of 
them, such as persistence or emotional 
stability, would be effectively to meas- 
ure all. If, however, we also refer to 
such traits as sociability, then there is 
a case for maintaining that here is a 
quite different dimension. If still other 
traits were used, other dimensions 
again might appear. “Popularity” and 
“confidence” tend to be synonymous, 
and relatively independent of either of 
the two dimensions obtained in this Te- 
search. “Maturity” changes its mean- 
ing, as if teachers always ask them- 
selves: “Maturity for what?” 

In so far as we obtain a halo effect 
we may say that when a teacher js 
rating one of his pupils on a number of 
traits he tends to ask himself one ques- 
tion only: “How does he get on with 


H. J. HALLWORTH 


themselves not one question, but pe 
namely, “How does he get on te | 
me?” (reliability and consateano 
ness) ; and “How does he get on W! 
other boys?” (extraversion). w 

It is apparent that the ane 
these two questions alone will go he 
explain teachers’ ratings on al 
traits used in this research. be 

The question whether they ae 
described as referring to two ot e 
stereotypes will be dealt with in a 1) 
article. 


SumMary 


Teachers in six boys’ seconded 
schools in England rated their sare 
on 12 personality traits, and a corre : 
tion matrix was obtained fo “ for 
teacher. Matrices were combined pe 
classes of different age and school Te 
into four groups. In contrast to two 
usual finding of a “halo” effect, se 
dimensions were obtained in each ca 
and were identified as “reliability 22% 
conscientiousness” and “extraversio™ 
It is suggested that when rating puPi 
on a variety of traits, teachers may 3 
described as asking themselves ” 
questions: “How does he get on with 
me?” and “How does he get on wit! 
other boys?” 


REFERENCES 
Castanepa, A., McCanouess, B. R, br 
Patermo, D. §. The children’s form oy 
the manifest anxiety scale. Child De 
velpm., 1956, 27, 317-326. 
OLLAND, J. L. Some limitations of teachet 
ratings as predictors of creativity. ' 
educ. Psychol., 1959, 50, 219-223. | 


me?” or “How much do I approve of 
him?” In effect, it is suggested in be. 
article that, if we look more closely &' 
ratings on a variety of traits, we oe 
reasonably conclude that teachers a8 


Ryan, F. J. Trait ratings of high school stu- — 
dents by teachers. J. educ. Psychol 
1958, 49, 124-128. 

Scuarrsr, B.S. A circumplex model for ma- 
ternal behavior. J. abnorm. soc, Psychol. 
1959, 59, 226-235. 


(Received April 7, 1961) 


Journal of Educational Psychology 
1961, Vol. 52, No. 6, 303-307 


CONCEPT ATTAINMENT AS A FUNCTION OF 
NUMBER OF POSITIVE INSTANCES 
PRESENTED 


EUGENE SHORE 
Pennsylvania State University 


Callatine and Warren (1955) have 
reported an experiment in which it was 
demonstrated that superior perform- 
ance on a concept attainment problem 
an from the repeated presentation 
rs ee or a few instances of the con- 
a as opposed to single presentations 
ee i larger number of instances. How- 
ee a superior performance in this case 
ia ers to errors made during learning, 
( + Associating the concept name 

Nonsense syllable) to the instances 
Presented. Callantine and Warren 
— out that performance on new in- 
waked what they refer to as transfer, 
tol etter in the group which received 
a gle presentations of 20 different in- 

ances, 

Pr additional research evidence 
pirate. on the problem of concept at- 
fiat Ment and variety of instances is 

(194 totally consistent. Luborsky 
t. 5) using an aircraft recognition 
bees found eight exposures of a plane 
© be more effective than three expo- 

Sures, but the five additional exposures 

iffered qualitatively from the com- 


Mon three, which were diagrammatic. 
“consist- 


eed (1946) reported more | 
pay concepts when more instances 
_Were given, but he did not control for 
‘he number of times each instance was 
Presented to each subject. And, Adams 
(1954), using @ problem based on spa- 
i ts 


of lights found 
tial arrangeme? that training on 


Somewhat suprisingly, b 
n b was better than 
@ single kind of cats a transfer 


: ining in 
mali tn and Heng 
seicke 


dams’ study with an 
(1952) reper rap and concluded that 
: geet y of problems and a high 

ee 303 


AND 


LEE SECHREST 


Northwestern University 


level of learning are necessary for 
transfer, and, presumably, concept 


learning to take place. 

The present experiments were de- 
signed to extend the work of Callan- 
tine and Warren, who used only a 
single type of concept. Specifically, in 
the studies to be reported concepts 
based on nonsense syllables were cho- 
sen to represent differing levels of dif- 
ficulty and differing concept attri- 
butes. The first experiment proved to 
involve possible methodological com- 
plications which were avoided in the 


second. 
Experiment I 


Procedure 

Sixty-four male subjects from an intro- 
ductory psychology class were assigned 
randomly to one of four experimental con- 
ditions: 

Group 3 was presented with three dif- 
ferent instances of each of four concepts, 
each instance being presented six times. 

Group 6 was presented with six different 
instances of each of four concepts, each 
instance being presented three times. 

Group 9 was presented with nine different 
instances of each of four concepts, each in- 
stance being presented two times. 

Group 18 was presented with 18 different 
instances of each of four concepts, each in- 
stance being presented only once. Thus, 
each of the four groups was presented with 
a total of 72 separate stimuli, 18 for each 
concept. The groups differed in terms of 
the number of different instances of the 
concepts which were to be discovered. 

_ The concepts selected for study were de- 
rived from common characteristics of certain 
nonsense syllables. On the basis of pre. 
liminary study, concepts at four lev 1S f 
difficulty were chosen for use in th aoe 
difficulty having been defined j a 
the inverse of the numbe ae eed 

r of test instances 


of the concept which could be identified 


304 


after a training series on each of the con- 
cepts. 

The nonsense syllables were obtained 
from Glaze’s lists (Hilgard, 1951), with an 
attempt being made to utilize only syllables 
of low association value. Most of the in- 
stances were chosen from the 13%, 20%, 
and 27% association value lists. Each of 
the four concepts to be utilized was as- 
signed a simple name for use in the experi- 
ment. The definitions, names, and sample 
instances used for each of the concepts are 
presented in Table 1. 

The subjects were run in small groups of 
from 6 to 10 members each. The instances 
of the concepts were printed on white 4 x 
6 index cards. They were exposed by hand 
against a neutral background at the rate 
of one every 13 seconds. The subjects were 
given 8 seconds in which to mark an ap- 
propriate answer in a booklet, were told to 


TABLE 1 


DEFINITIONS, Names, anp SAMPLE 
Instances or Concepts 


Concept Definition Instances 
DASH Middle letter A KAG, ZAB, XAP, PAQ 
TRIANGLE Ending in X, Y, GOX, PIY, JEz, KUY 

orZ 
por Letters formed of | rrw, KEV, ZEH, YIK 
straight lines or 
middle letter I 
or E 
CROSS Letters can be re- HES, XOP, YOJ, MUH 
arranged tospell 
word 
TABLE 2 


Mean Noumper or Correcr IpENTIFICA- 
TIONS OF Test Instances or EAcn 
or Four Concerts wirnin Eacu 
or Four Groups 


Concept 
Group 
DASH Anouk DOT | cRoss | Total 
3 4.50) 1.69 | 1.37 | 2.19 | 9.81 
6 4.31) 2.56 | 1.44 | 1.75 | 10.06 
9 4.81] 2.25 | 2.37 | 2.12 | 11.56 
18 4.12) 3.36 | 1.56 | 1.81 | 10.81 
Total of | 17.75] 9.81 | 6.81 | 7.87 
means 


EUGENE SHORE AND LEE SECHREST 


TABLE 3 
- ° D 
ANALYSIS OF VARIANCE FOR ReEPEATE 
MEaAsuREs on ToTaL CorREcT 
IDENTIFICATIONS OF TEST 


INSTANCES 
Source af SS MS F 
Between subjects 63 | 120.44 9.82 
Between instances 3 | 13.50] 4.50 : 
Error (between) 60 | 106.94 1.78 
Within subjects 192 | 577.50 95.79** 
Between concepts 3 | 204.03 | 68.01 a 
Interaction 9 30.78 3.33 a! 
Error (within) 180 | 342.69 1.90 


"p< 01. 


look up, and at 10 seconds were told se 
correct answer for the instance aaen im 
The instance was exposed for 3 seconds is 
lowing the reading of the correct a is 
In the use of a correction procedure ie 
experiment follows Callantine and Warre 
(1955). in- 
After the presentation of the first 36 a 
stances and again at the end of the presen 
tation of the entire series of 72 instances, ® 
blank card was exposed and subjects Sr 
asked to try to write the definition of eac 
of the concepts, : 
Following the presentation of the series 
of 72 training instances of the concepts a0! 
the attempt to write the definitions of the 
concepts, the subjects were shown 20 ad- 
ditional new instances, five for each con- 
cept, and were told, “This time you will 
not be told the correct names of the exam- 
ples. Remember, there is no penalty for 
Suessing.” The scores for cach subject were 
the number of correct responses to each of 
the five test instances of the four concepts. 


Results 


In Table 2 are presented means for 
the scores based on responses to the 
test instances. In all groups and for all 
concepts, identification scores exceed 
chance, albeit by very little in some 
cases. Analysis of variance (Table 3) 
indicated that only the difference in 
difficulty of the four concepts was sig- 
nificant. However, it became apparent 
that the four groups differed not only 
in the number of available positive jn- 
stances of each concept, but in the 


CONCEPT ATTAINMENT 305 


number of negative instances as well. 
Since a positive instance of por is also 
a negative instance of the other three 
Concepts, the four groups were quite 
unequal with respect to negative in- 
stances. The possibility exists that an 
Increase in negative as well as positive 
instances could increase cognitive 
Strain to such an extent that informa- 
ton available from positive instances 
Ba = lost. A second experiment was 
Utter ee groups for negative 
siemens hile varying positive In- 


Experiment II 
Procedure 


am for concepts DASH, TRIANGLE, 
ae fonige were employed to construct new 
onditn three experimental conditions. In 
(fees lon 3 the subjects were exposed to 
iene of one concept, each 
3 ‘ie ae times, and six instances of each 
sing : other two concepts, each instance 
PN es three times. For Condition 
3 se cts were exposed to nine instances 
each Se na repeated twice, and six of 
subject the others. And, in Condition 18, 
euah wat saw 18 instances of one concept, 
Sthes Y once, and again six of each of the 
wilitd with three repetitions. All lists were 
a mized save for restrictions against 
Petition of an instance within a brief 
Series, 
Since the design was factorial with respect 
© number of instances of the concept, 3, 9, 


or 18, and concept difficulty, DASH, TRIANGLE, 
ups, each 


ntal pro- 


. Following presentation 
Subjects were asked for 


Correct responses for 
Concept during # 


Results 


_. The means f 
identifications © 6 40 
manipulated sa te 
nine subgroups 2F¢ P 


‘or number of correct 
f the experimentally 
each of the 
nted in Table 


4, Again the means indicate that per- 
formance was better than chance in all 
groups—by a somewhat greater mar- 
gin than in Experiment I. 

Analysis of variance which is sum- 
marized in Table 5 yielded once more 
a highly significant effect attributable 
to differences between the concepts al- 
though, as reference to Table 4 will 
show, the principal difference is obvi- 
ously the lower difficulty of pasH. As 
Table 4 also indicates TRIANGLE and 
cross are reversed in difficulty from 
Experiment I. 

Of greater interest is the significant 
interaction between number of instan- 
ces and concept. Inspection of the 
means would seem to indicate that for 
some reason nine instances, each re- 
peated twice are less effective for DASH, 


ABLE 4 
Mean Number or Correct IpENTIFICA- 
TIoNs oF Test INSTANCES OF THREE 
Concerts iN Exrertment IT 


Concept 
Condition 
DASH |TRIANGLE] CROSS Total 
3 7.86 | 3.57 | 6.27 | 17.70 
9 6.14] 6.57 | 4.57 | 17.28 
18 8.00 | 4.29] 4.71 | 17.00 
Total 22.00 | 14.43 | 15.55 
TABLE 5 


ANALYSIS OF VARIANCE OF THE NUMBER 
or Correct IDENTIFICATIONS OF TEST 
InsTANcEs IN Expertment II 


Source of SS MS F 


Number of | 2 0.60 | 0.30] 0.08 
instances : 


Concepts 2) 77.74 | 38.87 | 10 
2 5 -62** 
Instances X | 4] 61.4 
a 1} 15.385] 4.19* 
Error 54 | 198.06 | 3.66 
Total 62 | 337.81 
“p< .05. 
p< 01. 


306 


the very easy concept, than either 
fewer or more instances, that the very 
opposite is true for TRIANGLE, and that 
for cross a few instances repeated sev- 
eral times are better. The results from 
Experiment I would support the con- 
tention that TRIANGLE is not easily 
solved with very few repeated in- 
stances, 


Discussion 


Obviously the answer to the ques- 
tion of optimum number and fre- 
quency of instances to be presented in 
& concept attainment task is still ob- 
seure. The single interaction found 
pointed to the probable complexity of 
the answer, which will very likely be 
shown to be a function of rather speci- 
fic characteristics of the concepts in- 
volved. 

In Experiment II, for example, the 
three concepts differed, not only in dif- 
ficulty, but in the characteristics of the 
instances on which they were based. 
DASH is based on the rather obvious 
characteristic of nonsense syllables, 
the particular letters from which they 
are formed. TRIANGLE is based upon 
characteristics of the letters them- 
selves, and cross requires a mental 
manipulation or restructuring of the 
stimuli presented. It may well require 
the repetition of an instance on several 
occasions in order for most subjects to 
eliminate the alternatives more obvi- 
ous than syllables with straight letters 
or rearranged words, 

One difficulty which still existed in 
the procedure employed was that the 
contexts were not the same for all con- 
cepts. Thus, when Trrancir was being 
experimentally treated, it was being 
presented in the context of a difficult 
and an easy concept. pasu, by contrast, 
was always presented in the context of 
two more difficult concepts. Future 
work should make use of a common 
context of instances for all experi- 
mentally treated concepts. 


EUGENE SHORE AND LEE SECHREST 


It is believed that the discrepancy 
between our results and those of Cal- 
lantine and Warren (1955) may result 
both from the different materials used 
and the fact that our experiments were 
adapted to group procedures. F nee 
experiments are in progress which 
should throw some light on questions 
raised above. 


SUMMARY 


Two experiments were carried out in 
an attempt to determine whether con- 
cepts are more easily discovered from 
the repeated examinations of a few 1n- 
stances of the concept or from single 
examinations of larger numbers of in- 
stances. Additionally, an attempt was 
made to vary the difficulty level of the 
concepts studied. The materials use 
consisted of concepts based upon non- 
Sense syllables. In the first experiment 
there were no significant effects attr’ ib- 
utable to the number of different 1n- 
stances presented, but the conditions 
were thought to be confounded with 
varying numbers of negative instances. 

Second experiment was performed in 
which number of instances and type 
of concept were varied in a factorial 
design with number of negative in- 
stances being constant across condi- 
tions. Employing recognition of test 
instances as a measure, a significant 
interaction between number of instan- 
ces and specific concept was found. Al- 
though not unequivocally interpret- 
able the results appeared to indicate 
that when a concept is based upon less 
obvious characteristics of instances or 
required some mental transformation 
of stimuli, repetition of instances is 
desirable. Repetition may be less nec- 
essary with relatively obvious con- 
cepts. 

REFERENCES 


Avams, J. A. Multiple woreus single problem 
training in human problem so ving, J. 
exp. Psychol., 1954, 48, 15-18, 

Cauuantine, Mary F., & Wanren, J. M. 


CONCEPT ATTAINMENT 


Learning sets in human concept forma- 
tion. Psychol. Rep., 1955, 1, 363-367. 


Hiaarp, E. R. Methods and procedures in 


the study of learning. In S. S. Stevens 
(Ed.), Handbook of experimental psy- 
chology. New York: Wiley, 1951. Pp. 
517-567. 

Uborsky, L. I, Aircraft recognition: I. 
The relative efficiency of teaching pro- 
ac J. appl. Psychol., 1945, 29, 385- 


307 


Monraisett, L., Jr., & Hovuanp, C. 1. A com- 
parison of three varieties of training in 
human problem solving. J. erp. Psychol., 
1959, 58, 52-55. 

Reep, H. B. Factors influencing the learn- 
ing and retention of concepts: III. The 
origin of concepts. J. exp. Psychol., 1946, 
36, 166-179. 


(Received May 31, 1961) 


Journal of Educational Peychology 
1961, Vol. 52, No. 6, 308-316 


THE ENVIRONMENTAL ASSESSMENT TECHNIQUE: 
A WAY TO MEASURE COLLEGE ENVIRONMENTS! 


ALEXANDER W. ASTIN anp JOHN L. HOLLAND 
National Merit Scholarship Corporation 


The purpose of this report is to de- 
scribe a method for measuring the col- 
lege environment, and to present some 
data on its validity and reliability. 
Even though it is axiomatic that, to 
predict behavior efficiently, we must 
assess the environment as well as the 
person, few techniques are available 
for assessing human environments, 
Barker and Wright (1954) have de- 
scribed an elaborate procedure for re- 
cording the child’s experiences in his 
community. For assessing a more re- 
stricted type of community—the col- 
lege environment or press—Pace and 
Stern (1958) have developed the Col- 
lege Characteristics Index (CCI), a 
true-false inventory by which students 
can describe the characteristics of the 
student body, faculty, and administra- 
tion of the college. More recently, 
Lazarsfeld (1959) and Dailey (1960) 
have made suggestions for research 


which may produce other methods to 
assess the environment. 


RATIONALE 


The Environmental Assessment 
Technique (EAT) is based on the no- 
tion, suggested by Linton (1945) and 
others, that a major portion of en- 
vironmental forces is transmitted 
through other people. We can infer 
from this that the character of a social 
environment is dependent upon the 
nature of its members. Moreover, the 
dominant features of an environment 
are dependent upon the typical char- 
acteristics of its members. If, then, we 


1 This study is a part of the research pro- 
gram of the National Merit Scholarship 
Corporation and was supported by a grant 
from the National Science Foundation. 


know the character of the people etn 
group, we should know the clima 
that group creates. 

As a first step, we have assumed 
that the college environment or “press 
is a product of the following attributes 
of the student body: the total number 
of students in the college, the average 
intelligence of the students, and os 
personal characteristics of the meri 
body (as estimated by a typology © 
six types). These eight variables ar¢ 
assumed to encompass an extensivé 
range of the student’s attributes, 1D~ 
cluding his personality, interests, val- 
ues, originality, self-concept, parent® 
background, goals, and aptitudes. AS- 
sessing colleges by means of the EA 
should yield environmental descrip- 
tions which will enable us to study the 
interactions of students and colleges i2 
terms of a variety of educational and 
personal outcomes, 


EnvironmentaL MrasurEs 


Explicit definitions of the eight EAT 
variables are given in the following sections. 

Institutional Size. The size of the student 
body (iV) is simply the total enrollment at 
the institution. Since the distribution of col- 
lege enrollments tends to have a markedly 
Positive skew, the square root of N was 
chosen as the actual measure of size. This 
transformation simplifies computations by 
reducing the number of digits and makes 
the distribution of college size more nearly 
normal. 

Intelligence Level of the Student Body. 

© secure intelligence estimates of the stu- 
dent bodies, mean scores on the National 
Merit Scholarship Qualifying Test 
(NMSQT) were obtained for a sample of 
students entering undergraduate  institu- 
tions. These students came from a 10% 
national sample of high school Juniors scor- 
ing above the sixty-fifth percentile on the 
1959 NMSQT. Only the scores of students 


308 


MEASURING COLLEGE ENVIRONMENTS 


hak actually entered college were used. 
ti pe attended by less than 15 of these 
385 fe s were excluded, leaving a total of 
the oe granting institutions. 
represe = lowing check was made on the 
ean qe aeness of differences between 
ples N. elligence levels in the smaller sam- 
a Psi oe 15 and 24): mean scores 
which vw, EB Scholastic Aptitude Test, 
reshinn Ai available for the entire 1956 
65 ae ass (Fishman, 1957) at 37 of the 
ae enrolling Jess than 25 
Ginempanae were correlated with the 
earson ing mean NMSQT scores. The 
actory coefficient of 92 indicated satis- 
view of ebresentativensss, particularly in 
ifehent e fact that different tests, given in 
je yes were being compared. 
The thet as of the Student Body. 
in BAT as of personality assessment used 
heory a ased on Holland’s (1959b, 1961) 
Pothesis Pott eas choice. A central hy- 
ferent ts this theory is that persons in dif- 
ities, Woe have different personal- 
oma — has proposed a classification 
Ypes: z mas in terms of six personality 
Yentional, Wee Intellectual, Social, Con- 
each of cm Enterprising, and Artistic. For 
Mary of ese types there is a narrative sum- 
“model personal characteristics called the 
he Pega orientation.” By identifying 
can ute to which any vocation belongs, we 
Miniature. Person’s vocational choice as & 
ape e Personality “test.” For example, 
‘| rson’s choice is engineering, which falls 
> < Realistic class, we would expect him 
mod ees some of the characteristics of the 
ph el Realistic orientation: masculine, 
Ysically strong, unsociable, aggressive, ete. 
ne validities of these personal orientations 
an € been examined in several recent studies 
olland, 1959a, 1960, 1961)... 
agit this theory 4S el 
can, soe ine tee, BS described in terms 
the tions (personalities) of its 
members. ‘To take a simple example: We 
Would expect the environment or climate” 


of a to differ from 

he to the extent ae nt hinds 

ne ig At awyers to be di erent nds 
@ineers aD tend this notion to the 


‘e ex 
college, it should be P 
the eciteze ¢ vir oreion 
Major fields (0 
As a first ee: Pay 335 
environments pei ‘was classified as belong- 
college major Holland’s six classes. The as- 
ng if one OF Ue. alae to classes was znade 
A if eS the original formulations 


309 


in the theory and the keys (six lists of oc- 
cupational titles) in the Holland Vocational 
Preference Inventory (Holland, 19592). 
Cases of doubt were resolved in discussion 
by the authors. 

Descriptions of the model personal orien- 
tations and lists of the major fields included 
in each orientation are given in Table 1. 
These descriptions represent hypotheses 
about the climate and “press characteristics” 
associated with each type. Information about 
the number of students at a given institution 
who obtained a baccalaureate degree in & 
specifie major field was secured from the 
publication, Earned Degrees Conferred by 
Higher Educational Institutions: 1957-1958 
(United States Office of Education, 1959). 
Unclassifiable degrees, representing about 
32% of the total, were omitted from our 
analysis. 

To obtain measures 
orientations at any institu 
of majors (i.e., students in a major field) 
of each orientation was expressed as a per- 
centage of the total number of classifiable 
majors. For instance, the Realistic orienta- 
tion of a college environment would be: 


of the six personal 
tion, the number 


Realistic orientation = 


Number of majors 
classified Realistic X 100 


Total number of classifiable majors 


each institution was character- 
ized by the proportion of its students be- 
longing to each of the six personal orienta- 
tions, as determined by their choice of 
major field. 

Vauiiry or THE EAT 


An initial test of the EAT’s validity 
was made using the College Charac- 
teristics Index. In the CCI, observers 
(usually students) at the institution 
describe the college in terms of 300 
dichotomous items. These items are 
scored on 30 press scales of 10 items 
each. Some of these CCI data, which 
were obtained in 1958 from. Merit 
Scholars and Finalists at 36 darting 
tions,” have been used in two vetona 


In this way, 


? Amherst, Brown, Cali i 

alifornia i 
st Technology, Carleton, Carnegie, See 
a 7 2 ae Colorado, Cornell Dart. 
5 e, Georgia Institut ; ‘i 
Boley, Harvard, Indiana, Iowa a — 
, Massachusetts Institute of Technology, 
Ys 


310 


ALEXANDER W. ASTIN AND JOHN L. HOLLAND 


TABLE 1 


CoLiece Masor Fietps Correspon 


; NS 
DING TO Each or Stx Personan ORIENTATIO 


Orientation 
——S— | 


Description (from Holland, 1961) 


Relevant major fields 


Realistic 


“masculine, physically strong, 
unsociable, aggressive . . . pre- 
fers concrete to abstract”? 


ei ion 
agriculture, agricultural bso 
physical education, recreation, 


Intellectual 


understand” 


Social 

- needs attention 
intellectual 
+--+ orally dependent” 


Conventional “prefers 
ordinate roles .. , 

- identifies with 
ternals, and status” 
Enterprising “verbal skills for do: 
selling, 
orally aggressive” 


Artistic 


gross physical skills . 
ceptive .. 
ualistic expression” 


“task-oriented, intraceptive, aso- 
cial, prefers to think through 
rather than act out; needs to 


“sociable, responsible, feminine 


+++ Avoids 
problem-solving 


structured numerical 
and verbal activities and sub- 
conforming 
Power, ex- 


minating, 
leading others 


“asocial; avoids problems which 
are highly structured or require 


-- intra- 
- need for individ- 


dustrial arts, engineering, forestry: 
trade, and industry 


architecture, biological nein, Fe 
ography, medical technology; P ie 
macy, mathematics, philosop’ 
physical sciences, anthropology 


health education, education at 
ceptional children and gene 
retarded, speech correction, € aoe 
tion (unclass.), nursing, oc¢ a 
tional therapy, physical peg 
scholastic philosophy, socla ‘liza 
ence (general), American civi 
tion, sociology, social work 


accounting, secretarial, business 4) 
commercial (general and unclas #3 
business education, library science 
economics 


hotel and restaurant administration, 
hospital administration, history 
international relations, polite: 
Science, foreign service, industri 
relations, public administration 


art education, music education, Eng- 
lish and journalism, fine and ap- 
Plied arts (all fields), foreign lan 
guage and literature (all fields) 


studies (Astin, 1961; Thistlethwaite, 
1959). Each of the eight EAT meas- 
ures (~/N, Intelligence, and the six 
personal orientations) was correlated 
with the 30 CCI scales (Table 2). Of 
the 240 coefficients in Table 2, 23% 
are significant at the .01 level, and 
39% at the .05 level. 

The CCI scales most highly related 

Fog innesota, Northwes: ern, Notr 
ee ee Pennsylvania, Baaiony, 
Princeton, Purdue, Radcliffe, Rensselaer 
Polytechnic Institute, Rice, Smith, Stenford, 


iversi f Texas, Wiscon. 
arthmore, University o: . 
pgs Wesleyan, and Yale. 


to institutional size are AggressioD 
(.64), Achievement (—.59), Counter- 
action-Infavoidance (—.58), Under- 
standing (—.58), Passivity (.55), Fan- 
tasied Achievement (—.55), Deference 
(54), Sex (.54), Exhibition (.53), and 
Pragmatism (.52), These results sug- 
Sest that in the larger as opposed to 
the smaller student bodies there is 
more aggressive behavior, exhibition- 
ism, and heterosexual activity, and 
More deference shown toward faculty. 
In smaller student bodies, on the other 
hand, there is more academic competi- 


MEASURING COLLEGE ENVIRONMENTS 


311 


TABLE 2 


Tur RELATION oF ErcoT ENVIRONMENTAL Meas 


URES TO THE 


Conuece Cuaracteristics INDEX AT 36 INSTITUTIONS 


Personal orientations* 


CCT seal sizeof | gence o 
scales studen' 
body | Emig | eae | ante, | socat (QE pring [ArH 
Abasement 20 | —37 | 38|—-2%9| of] 1] 4) 8 
Achievement —59 gt | —-20| 33|-22|-35| 42] 35 
Adaptiveness a) ose | Mie] By Ble 
filiation-Rejection —35 og |—21| 38| 01-07] 05) 16 
&gression-Blamavoidance 64 09 45 09 | —25 0s | —38 = 
ange-Sameness —06 7 | 20| 14] -14| 02 | —10 | — 
Conjunctivity-Disjunctivity -19 06 | —06 16 | —07 | —24 22 = 
ounteraction-Infayoidance —53 49 os | 28 | —41| -29] 2 
Sference | —63 | 26| 55] 20| 30) 38 | —26 
ominance -19 a1 | —21| 32] 08|—02] 03 | 09 
Ego Achievement “w@ | -19 | —a4}—18| 38] 12) 18) 4 
motionality-Placidity —33 23 —25 21 | —14 18 20 = 
nuetgy-Passivity —55 a3 | —05| s1| 35-42) S| 
Exhibition-Infavoidance 53 | —45 | 14] -33] 43] 36 - 2 
Fantasied Achievement —55 49 |—32| 46] -17|-2| 35] 3% 
armavoidance —30 96 | —50| of] a] —-14| 55 
umanism —28 55 |—si| 23| 25| ov) 79] 6 
Mmpulsion-Deliberation 27 00 il 28 | —08 23 | —28 | — A 
arcissism | 39 |—i7|—43| Bo] ch] OL) FF 
urturance-Rejection _o3 | —26 | -37| —18| 39] -08| 22 46 
bjectivity —42 6s | —26| 46| 24] 26) 40 
rdex oe | —51 | 05] —31| 36) 15) —10) —08 
Eley | <e0 | 25] 24] 85] of) 98) © 
Pragmatism se | —e7 | 73] —32| —08| 02 | —73 | —88 
eflectiveness —29 50 |—-e2| 16) 14] OL) G4) Be 
cientism —18 ai | 38| 38 | —48| —08 | —22) —ee 
entience —28 47-70) 10| 9t)—18| 7 
Sex. _io | ~22| —24] 93] 15] a1] 98 
a . “os |—09| 10] 14] 24| —14 —13 
rance-Autonom, = = i A 3 
nderstanding y —58 70 | —28| 46] -23|—-24| 4 


Note—ros = .33, ro = 43. Deci 
The percentage of college m2) 


for intellectual 
ter involvement 
(Counteraction- 


tion, more striving 
achievement, and gre# 
in campus activities | 
nfavoidance, Passivity “the student 
Intelligence leve z lations 
body had 18 significant fon any et 
wi T scales, more © e 
; oe oe ven BAT variables. The 
hi ee rrelates of intelligence were 
‘ie a (.70); Objectivity 
erstan A —.67), Achieve- 


(.68), Pragm® (—.63), Pl 

? ference Oo), ay 
pa (64) 9 givens (—.55), Hu- 
(=.60), AGS? order (—51), and Re- 


:mal points omitted in table. 
rs falling in each class. 


flectiveness (.50). Most of these press 
characteristics are those we would ex- 
pect to find in the more intelligent 
student bodies: an emphasis on 
achievement, intellectual objectivity, 
seriousness of purpose, and a need to 
analyze and understand, rather than to 
order and structure. 

The CCI variables most closely re- 
lated to the first personal orientation, 
Realistic, were Humanism (—.81), 
Pragmatism (.73), Sentience (—.70), 
Reflectiveness (—.62), and Harm- 
avoidance (—.50). These correlations 


312 


imply a preference for the practical, 
the concrete rather than the abstract, 
and an aversion to feeling experience— 
traits which are consistent with the 
descriptions of the Realistic orienta- 
tion (Table 1) developed earlier (Hol- 
land, 1959b) . 

The Intellectual orientation had its 
highest correlations with Deference 
(—.55), Fantasied Achievement (.46), 
Objectivity (.46), and Understanding 
(.46). These results imply that college 
environments with high Intellectual 
orientations have many of the ex- 
pected characteristics: independence 
of thought and action, great need to 
achieve, and a need to analyze and 
understand. 

The CCI scales correlating highest, 

with the Social orientation were Nar- 
cissism (.59), Sex (.53), Exhibition- 
Infavoidance (.43), and Scientism 
(—.43). Here there is a close corre- 
spondence with the hypothetical de- 
seription (Holland, 1961) ; “sociable, 
responsible, feminine. , . needs atten- 
tion ... avoids intellectual problem- 
solving... orally dependent” (Table 
1). 
The three highest correlates of the 
Conventional orientation, Passivity 
(42), Exhibition-Infavoidance (.36), 
and Achievement (—.35), while of 
only borderline Significance, are also 
generally consistent with the descrip- 
tion given in Table 1. 

The Enterprising and Artistic orien- 
tations are similar in the patterns of 
their relationships with the CCI 
scales: Humanism (.79 with Enter- 
prising and .64 with Artistic), Prag- 
matism (—.73 and —.66), Sentience 
(.71 and .69), Reflectiveness (.64 and 
54), Harmavoidance (.55 and 56), 
and Blamavoidance (38 and 58). 
Perhaps one of the reasons that these 
two orientations turned out to be so 
similar, aside from the fact that they 
are known to be related (Holland, 
1959a, p. 29) is that many of the stu- 


ALEXANDER W. ASTIN AND JOHN L. HOLLAND 


dents majoring in “business and i 
mercial,” all of whom were includec bs 
the Conventional orientation, — 
more properly be included in , 
Enterprising orientation, pare 
those who plan careers in sales, pr 
motional, and related areas. oe 
The characteristics of the parting 
ing and Artistic orientations are ah 
the opposite of those associated Naa 
the Realistic orientation. While — 
again consistent with previous is 
Seriptions (Holland, 1959a), ave Na 
the suggestion that these three orien 
tions may be somewhat redandane, 
Another partial test of EAT’s bens 
ity was attempted using data repor is 
in the book, What College ta 
Think (Goldsen, Rosenberg, Williams, 
& Suchman, 1960). These investiE®” 
tors attempted to assess and compar 
the values, attitudes, and opinions 0! 
students at the following institution’! 
Cornell, Dartmouth, Fisk, Harvar@, 
Michigan, North Carolina, Texas, 
UCLA, Wayne, Wesleyan, and Yale. 
Even though the number of institu- 
tions was quite small, it seemed likely 
that the environments of these colleges 
Would differ markedly on the variable 
of Intelligence, Accordingly, 16 of the 
duestions asked of the student bodies 
were selected on the basis of their hy- 
Pothesized relation to Intelligence- 
These questions dealt with: the 
“mount of cheating admitted to by the 
students; preference for Republican as 
opposed to Democratic party; belief 
in God; importance of religion; atti- 
tudes toward labor laws, the welfare 
state, minimum wage, and a free col- 
lege education; importance in college 
of learning values; importance of ath- 
leties; belief in the existence of aca- 
demic freedom; “tendency to personify 
the fraternity”; and the relative jm- 
portance of several possible goals of a 
college education—general education, 
vocational training, training in how 4, 


MEASURING COLLEGE ENVIRONMENTS 


phe with people, preparation for 
ae marriage. 
as Int ri iabeceapna were ranked (1-11) 
NMSQT ee using the sample mean 
to 203) ; scores (Ns ranged from 15 
ai et they were also ranked (1-11) 
eh : of the 16 questions on the 
givin 0 the percentage of students 
iffere Positive responses, So that rank- 
puted :> correlations could be com- 
ilfigerse even of these rho’s were sig- 
Pariah, at the 05. level. The relevant 
ntelli es and their correlations with 
gence were: 


A 
hipaa eee to emphasize 
an 3 or a happy marria = 
d family life” ppy marriage 99 
Say me teachers are afraid to 
ays” hey really believe these —.84 
A 
cations Ee ought to “provide vo- 
and Pe eg develop skills __ gy 
© your ae directly applicable . 
Cligion is. . 
Satisfaction ** a “major source of — 80 
basin olewe ought to emphasize “a 
Precigg nett education and ap- 75 
ation of ideas.” 
crenge Plican political party pref- ng 
Belief in God —69 
ie) . 
N an exploratory basis, 14 more 
ted be- 


Yank 
ee Correlations were compu 
anu the other seven BAT measures 

two of the items which had not 


Correlated significantly with Intelli- 


ig Two more significant rho’s 
ti Obtained: the Conventional orl- 
Xtation was positively related (.73) 
‘ the “tendency to personify the fra- 
€rnity” and frequency, of admitting 
cheating was negatively | 


€ characterization of 


E jnstitu 
AT, the three at 335) cor 


jon were th high 
+ ,ctitutions with high- 
(Table 3). The ine ations are techno- 
primarily or ex- 


313 


TABLE 3 


Higuest Scortnc or 335 InsTITUTIONS 
on Each PErsoNAL ORIENTATION 


Realistic 
Worcester Polytechnic Institute 
Michigan College of Mining and Tech- 
nology 
Case Institute of Technology 


Intellectual 
Union College and University 
California Institute of Technology 


Reed College 


Social 
State Teache 
land) 
Danbury State College (Connecticut) 


Georgia State College for Women 


rs College at Towson (Mary- 


Conventional 
Seton Hall University 
Xavier University (Ohio) 
University of Seranton 


Enterprising 
Georgetown University 
John Carroll University 
Wabash College 


Artistic 
College of New Rochelle 
Emmanuel College (New York) 


Hollins College 


clusively for men; in contrast, those 
highest on the Artistic orientation are 
small liberal arts colleges for women. 
The highest scorers on the Social ori- 
entation include two coeducational 
teachers’ colleges and a liberal arts 
college for women. Those scoring high- 
est on the Conventional orientation 
are all large universities controlled by 
the Roman Catholic Church. (In fact, 
seven of the eight institutions ranking 
highest on Conventional are Roman 
Catholic institutions.) 

All three of the institutio i 
the Intellectual orientation once Mi 
for their strong emphasis on scl ia 
ship and research. The Cali de, Tee 

i alifo: 
stitute of ee 4 
re Technology, despite its 
ame, is classified as a “college, gradu- 


314 


TABLE 4 
Test-Retest RELIABILITIES OF THE SIX 
ENVIRONMENTAL PERsoNAL ORIENTA- 
TIONS FOR INTERVALS OF 1, 3, AND 
6 Years (31 InstiTuTIons) 


Reliability 
Orientation 

1 Year 3 Years 6 Years 

(1955-56) | (1955-58) | (1952-58) 
Realistic 99 -98 -97 
Intellectual 81 55 -O+ 
Social -88 -87 -80 
Conventional 94 -93 -93 
Enterprising -96 84 -83 
Artistic 95 -96 -93 


ate school, and institute of research in 
science, engineering, and the humani- 
ties” (Irwin, 1960, p. 186). 

Two of the three highest scorers on 
Enterprising, Georgetown University 
and Wabash College, are of particular 
interest in view of the hypothesized 
correlates of that orientation: “verbal 
skills for dominating, selling, leading 
others... orally aggressive.” George- 
town University is noted for its large 
Law School and its School of Foreign 
Service. Wabash College, too, with its 
Personal Development Program for 
“young executives,” its Institute of 
Politics, and its Washington Semester 
Program for “students of demon- 
strated ability and interest in public 
affairs” (Irwin, 1960, pp. 391-392), 
appears to place considerable empha- 
sis on “enterprising” activities, 


RELIABILITY OF THE PrRsonaL 
ORIENTATIONS 


A check on the test-retest reliability 
of the six measures of personal orien- 
tation was made using a 10% random 
sample (N = 31) of institutions from 
the original sample of 335.3 Scores on 

*The entire sample of colleges was not 
used because of the prohibitive amount of 
clerical labor involved in collating major 
fields from Earned Degrees Conferred by 
Higher Educational Institutions: 1967-1958 
(United States Office of Education, 1959) 
by college. 


ALEXANDER W. ASTIN AND JOHN L. HOLLAND 


the six personal orientations for these 
31 colleges were obtained for the grad- 
uation years of 1952, 1955, 1956, and 
1958 by collating the major fields 
listed in Earned Degrees Conferred 
by Higher Educational Institutions: 
1957-1958 (United States Office of 
Education, 1959) for those years. Ta- 
ble 4 shows retest reliabilities over 
intervals of 1, 8, and 6 years. With the 
possible exception of the Intellectual 
orientation, all measures appear to be 
highly reliable over the 1-year inter- 
val. Even over the 6-year period, all 
orientations except the Intellectual 
seem to have satisfactorily high reli- 
abilities. (Table 4 also shows that 
differences among institutions in their 
curricula are highly stable from year 
to year.) 


Iytercorrenations or EAT VariaBiEs 


Intercorrelations of the eight EAT 
variables are shown in Table 5. While 
all but four of the 28 correlations are 
Significant (p < .05), the highest cor- 
relation is only —57 (between the 
Realistic and Artistic orientations). 
An inspection of the larger correlations 
reveals two major clusters: the first 
combines Intelligence, the Intellectual 
orientation, and the Enterprising ori- 
entation; the second, involving the 

calistic (negatively), Enterprising; 
and Artistic orientations, was to be ex- 
Pected from the patterns of CCI cor- 
relations (Table 2) discussed previ- 
ously, 

It should be noted that the six per- 
Sonal orientations are self-limiting 

Le., for any college the six scores 
must sum to 100). Such dependence 
tee these measures accounts at 
east partially for the fact that 12 of 
the 15 correlations among the personal 
orientations were in a negative direc- 
tion. 
Discussion 


The results imply that this new 
method (EAT) for assessing College 


MEASURING COLLEGE ENVIRONMENTS 


TABLE 5 
INTERCORRELATIONS OF EAT VARIABLES 
(V = 338) 
2 3 4 5 6 7 8 

1. Size —.09 26 —.28 —.06 26 —.25 —.29 
2. Intelligence —.08 41 —.86 —.27 48 34 
3. Realistic orientation —.24 -—.87 -—.17 —.46 —.57 
4. Intellectual orientation —.40 —.17 -38 16 
5. Social orientation —.23 —.32 —.01 
6. Conventional orientation —.12 —.30 
7. Enterprising orientation 45 
8. Artistic orientation 


Note.—ros = 11; ror = 14. 


environments possesses moderate va- 
lidity and substantial reliability. 
Moreover, EAT, being limited to eight 
variables which are inexpensive to ob- 
tain,* lessens the computational prob- 
lems which arise in the study of college 
environments and brings such studies 
Within the financial means of more re- 
Searchers. 
_ One limitation of the present study 
js that it used the major fields of the 
Students, rather than their vocational 
choices. Since some of the major fields 
reported in Harned Degrees are am- 
biguous with regard to the actual oc- 
Cupations implied, it was necessary to 
make difficult and perhaps arbitrary 
decisions in assigning them to a par- 
ticular class, As was noted previously, 
is is particularly true of those “busi- 
Ness” my, ajors who plan careers in sales 
or advertising, and should be classified 
8s Enterprising instead of Conven- 
tional, Some of these problems would 
solved if we knew the student’s vo- 
cational choice. 
Tn addition, the presé 


hot give direct informa 


is likely, how- 
nat faculty, It is likely, 
as level of ability, number, 
? 


f the student body 
; «sence level J dy 
saute Benes be expensive 0 abjain; 
owever, most institution® routinely i 
, 


Jasses with stand- 
ge ing freshman 
ee no eee 
ps izec ae absence of such tes 
en in 


rank of the students 
average high a a useful estimate of 
would probably in the student body. 


intelligence leve 


nt scheme does 
tion about the 


and “primary occupations” (fields of 
specialization), the faculty of an insti- 
tution resembles its student body. This 
assumption is supported by the fact 
that many of the CCI items are actu- 
ally descriptions of specific faculty be- 
haviors. In addition, Pace and Stern 
(1958) found that faculty and student 
reports of the college press were simi- 
lar. 

EAT appears to be especially well- 
suited to the study of college-student 
interactions. A major problem in such 
studies is how to assess the similarity 
between persons and environments in 
a meaningful way. Since the EAT is 
based principally on a heuristic theory 
of vocational choice and occupational 
types, it permits us to classify both 
students and colleges according to the 
same six types. More importantly, the 
same operations (vocational choice or 
major field) are used for typing both 
the student and his environment. 

The present results also indicate 
that the attributes of the student body 
reflect a major portion of what has 
been called the college press or envi- 
ronment,® since the press characteris- 


*It should be pointed out that such a 
conclusion is not necessarily at variance 
with studies such as those of McFee (1961) 
who found no evidence at one college of a 
systematic relationship between the needs 
of individual students and their perceptions 
of the college press. The question of th 
environment’s being related to the nat: : 
of the students concerns aggregates of ane 
dent characteristics across institutions a 


316 


tics (as measured by the CCI) of a 
particular school can be predicted with 
modest success simply from a knowl- 
edge of the students’ number, average 
intelligence, and distribution of major 
fields. This conclusion is supported by 
a recent study of college productivity 
(Astin, 1961), which showed that, 
even for small samples of students en- 
tering college, the proportion planning 
to major in natural science was highly 
related to several student and faculty 
press variables. It is obvious that in 
future studies of college “influence,” 
the students’ characteristics must be 
taken into account before we attribute 
differences in performance to the col- 
lege press, 

One of the most compelling aspects 
of EAT is the possibility it offers for 
measuring other kinds of environ- 
ments. If, as the present data suggest, 
a simple occupational census of the 
members of any group gives a valid 
estimate of the environment or climate 
of that group, it should be Possible to 
conduct a wide variety of cultural and 
sociological studies using an elemen- 
tary method. Studies of other institu- 
tions and cross-cultural studies, in 
particular, would seem especially 
suited to this method. 


SumMary 


A method for measuring the college 
environment, the Environmental As- 
sessment Technique (EAT), was de- 
scribed. Measures of eight character- 
istics of the student body—its size, 
average intelligence, and six “personal 
orientations” —were validated against 
the College Characteristics Index at 
36 colleges. The six personal orienta- 
tions, which were obtained from a cen- 
sus of students’ major fields at each 
institution, were also shown to be 
highly stable over time. Some of the 
advantages, limitations, and Possible 
applications of the EAT were dis- 
cussed. 


ALEXANDER W. ASTIN AND JOHN L. HOLLAND 


REFERENCES 

Astin, A. W. A re-examination of a 
productivity. J. educ. Psychol., 1961, 
52, 173-178. . 

Barker, R. G., & Wricut, H. F. Midwest 
and its children. Evanston, Ill.: Row, 
Peterson, 1954. ; ies 

Datey, C. A. Natural history and a0 
nomenology. J. indiv. Psychol., 1960, 
16, 36-44, 

Fisuman, J. A. 1957 supplement to hea 
Board scores no. 2. New York: Col cg 
Entrance Examination Board, 1957. ™ 

Goupsen, Rost K., Rosensera, M., W Te hit 
R. M,, Jr. & Sucuman, Ez. AL Van 
college students think. Princeton: 
Nostrand, 1960. the 

Hoiianp, J. L. Preliminary manual for a 
Holland Vocational Preference a 
tory. (Res. ed.) Palo Alto, Calif.: 
sulting Psychologists Press, 1959. cm, 

Hottanp, J. L.A theory of vocational ch ( 
J. counsel. Psychol. 1959, 6, 35-45. ia 

Howtann, J. L. The relation of the jo 
tional Preference Inventory to the oa 
teen Personality Factor Questionna 
J. appl. Psychol., 1960, 44, 291-296-. i 

Houtann, J. L, Some explorations © 
theory of vocational choice. Brant? 
Ill: National Merit Scholarship C° 
poration, 1961. (Mimeo) . ‘ties 

Irwix, Mary (Ed.) American universitt 
and colleges. (8th ed.) WashingtoD 
D. C.: American Council on Education, 
1960. 


Lazarsretp, P,P, Progress report on a proj 
ect to map out the general area of nom 
intellectual factors in the prediction © 
college success, New York: College a 
trance Examination Board, 1959: 
(Mimeo) 

Linton, R. The cultural background of per 
sonality. New York: Century, 1945. 

McFerp, ANNE. The relation of students 
needs to their perception of the college 


environment. J. educ. Psychol. 1961, 
52, 25-29. 
Pace, C. R., & Srery, G. G. An approach 


to the measurement of psychological 
characteristics of college environments. 
J. educ. Psychol., 1958, 49, 269-277. 
THISTLETHWarrn, D. L. College press an 

student achievement. J. educ. Psychol, 
1959, 50, 183-191. 
TED States Orrice or Epucarion, Harned 
degrees conferred by higher educational 
institutions: 19657-1968. (Circular No. 
570) Washington, D. C.: United States 
Department of Health, Education, and 
Welfare, 1959. 

(Received June 4, 1961) 


Uni 


Journal of Educational Psychology 
1961, Vol. 52, No. 6, 317-321 


THE DESIRE FOR SECURITY: 
AN ELEMENT IN THE VOCATIONAL CHOICE OF 
COLLEGE MEN? 
STUART H. BLUM* 
Teachers College, Columbia University 


Considerable investigation has been 
Sarried out by workers of many dis- 
Ciplines in attempting to determine the 
po in which various factors are 
biligs Po vocational choice. Factors 
divs se to be operating in this choice 
at umerous and include such con- 
cific as overall mental capacity, spe- 
fone ent interests, and tempera- 
lees : ne of the often mentioned, but 
ie Par inane of these factors is 

esire for security. 

Sere and counseling experi- 
civilinn i military personnel and 
. ege students on the part of 
Dyn enor has yielded the im- 
om ath that a desire for security is 
velvet he most conspicuous factors in- 
som | an the selection of an occupa- 
i or in the choice of a specific job. 
xpressed emphasis by interviewees 
©n such factors as fringe benefits, ten- 
Ure, guarantee of employment, and on 
€ use of the term security itself, 
Often far outweighed concern about 
the content or nature of the actual 
work. Increasing trends toward a so- 
called Welfare State and both con- 
emporary nonfiction and reports of 
Scientific studies suggest an increased 
interest on the part of the general pop- 

Ulation in obtaining security. 

The article is hae chee eciioen 
Sertation submitter The author wishes to 
Columbia Univers jation for the guidance 
express his appre yeceived from the mem- 
and encouragemen tion Committee, Robert 
bers of his ! sso irman), Elizabeth P. 
L. Thorndike late Irving D. Lorge. 

Thgem, snd} a Testing, and Conneelig 
Center, Hofstt® colleses Hempstead, “New 


The purpose of the study on which 
this report is based was to ascertain 
differences among individuals with re- 
spect to the degree of importance 
placed upon a desire for security as a 
factor in vocational choice, and to de- 
termine how this degree of emphasis 
is related to other pertinent character- 
isties or variables of the individual. 
Whereas any measure of security or 
insecurity is probably a reflection of 
a deeper psychological security, for 
purposes of the present investigation 
the deeper psychological aspects were 
largely treated as measurable normal 
personality variables. 

The areas for investigation were 
based on the expectation that: 

1. Degree of emphasis placed upon 
security as a factor in vocational 
choice exists as a trait of some com- 
prehensiveness that can be measured 
reliably. 

2. A positive relationship exists be- 
tween emphasis upon security in vo- 
cational choice and the actual choice 
of a job situation of a secure type. 

3. Significant relationships exist be- 
tween emphasis upon security in voca- 
tional choice and normal personality 
variables as measured by the Edwards 
Personal Preference Schedule (1957). 

4, Significant relationships exist be- 
tween emphasis upon security in voca- 
tional choice and background factors. 
P Since = instrument to measure the 
doce of eshte sito Wy. eon. 

fa is noice was not 
available that met desired specifica- 
ro a 40-item inventory was devel- 


York. 317 


318 


The Security Inventory? 


The final form of the Security In- 
ventory contained 40 forced-choice 
items of the triad type. Of the three 
alternatives contained in each item 
one was scored as contributing to the 
Security variable which was thus ex- 
pressed as a simple raw score, with a 
possible range from 0-40. 

The content of the items was based 
on the investigator’s interviewing and 
counseling experience, and utilized 
concepts expressed in Caplow (1954) , 
Daugherty (1941), Hayek (1944) , 
Hoppock (1935), Lindahl (1949) , 
Lorge and Blau (1942), McGregor 
(1944), Nolan (1956), Riesman, 
Denny, and Glazer (1953), Roe 
(1956), Shartle (1952) , Thorndike and 
Hagen (1957), Viteles (1953), and 
Whyte (1956). The basic conceptual- 
ization of a desire for security as ex- 
pressed in the items included, when 
compared to other alternatives, a pref- 
erence for: membership in a group; 
fringe and retirement benefits; free- 
dom from the ups and downs of busi- 
hess or political conditions (including 
the concept of a guaranteed annual 
wage); dependence on rules, regula- 
tions, standards, procedures, and deci- 
sions of superiors (freedom from re- 
sponsibility on the Job); physical 
safety; allowance for personal short- 
comings of the individual; existence of 
similar jobs in other organizations or 
under conditions of national emer- 
gency ; lack of worry, risk, and compe- 
tition; assurance of the maintenance 
of a certain standard of living; per- 
sonal comfort; a fixed overall work 


*A copy of the complete Security In- 
ventory as well as a detailed description of 
its construction may be found in the disser- 
tation, Security in Vocational Choice: A 
Study of Male College Upperclassmen, 
available from University Microfilms, In_ 
corporated; 313 North First Street; Ann 
Arbor, Michigan: Order No. Mic 60-5083 
remitting $3.35 for microfilm, or $4.80 for 
xerographie enlargement. 


STUART H. BLUM 


schedule; compensation, promotion, 
and retention, based on amount and 
length of time worked; working for an 
organization (rather than being own 
boss) ; the proven and the established; 
the familiar; routine and certainty. 
The booklet of the Security Inven- 
tory was entitled, Attitudes in Voca- 
tional Choice, and the subjects were 
not informed that desire for security 
was being measured. All items were to 


be completed in spite of any difficulty 
of making a choice, 


For inclusion in the Security Inven- 
tory, the selection of the final items 
from an original pool of 80 was pri- 
marily based on agreement by judges 
as to the Security response in each 
item and upon a subsequent item anal- 
ysis with a preliminary group of 1 - 
male college students. 


Procepurp 


In addition to the Security Inventory and 
the Edwards Personal Preference Schedule, 
a 24-item biographical information ques 
tionnaire which included items designed t© 
obtain some indication of the actual J° 
Situation desired by each individual as We 
as his characteristics and experiences con- 
sidered pertinent to the study, was prepared. 

.The complete set of instruments was 8¢- 
ministered to 513 students enrolled in the 
following institutions: City College (New 
York), Fairleigh Dickinson. University, Col- 
gate University, Buffalo University, and Co- 
lumbia University. Of this number 404 stt- 
dents met the qualifications of bona fide 
male upperclassmen (juniors and seniors 
aged 80 or less), In addition to practical 
considerations such as availability of sub- 
Jects, ease of administration of instruments, 
and motivation for cooperation, the male 
college upperclassmen group provides in- 
formation and Suggests implications for both 
education and industry as well as for psy- 
chology. Further, this group represents the 
Potential leadership in many areas of our 
Society at a period in time when the group’s 
members are focusing on their vocational 
choices. Because the study was concerned 
not with absolute distribution, but rather 
with relationships between results on vari- 
ous measures, the primary concern in the 
selection of subjects was to secure & hetero- 
geneous sample of upperclassmen jn order 


| 


DESIRE FOR SECURITY AND VOCATIONAL CHOICE 


that adequate range of response would be 
obtained. 

Inasmuch as the Security Inventory 
yielded the variable against which all the 
other variables examined were to be com- 
uit, the limited class time was used for 
ane administration of this instrument. The 
aa te completed the Edwards Personal 
Gee Schedule and the biographical 
ih ormation questionnaire out of class on 

cir own. A system of individual code num- 

i provided anonymity, and as is con- 
a with obtaining some degree of moti- 
ne and with fairness and_ ethical 
nor aes there was some feedback of re- 

patie f the 404 subjects who met the quali- 

cia ms for inclusion in the sample, 86%, 

6 students, completed all the materials. 
digo (the total number of “security 
ri — es” chosen) on the Security Inven- 

Beronct Foe raw scores on the Edwards 
lated Wh reference Schedule _Were corre- 

iographi mp possible, the items of the 

‘oded Rs ad information questionnaire were 
Ment oe as to make possible product-mo- 

j ia with the raw scores of 

Procedure y Inventory. Analysis of variance 
analysie es were utilized where correlational 

was not feasible. 


. ReEsuLts 

a ne, Successful development of the 
cial to 4 Inventory was obviously cru- 
Sot urther pursuit of the investiga- 
an “ On the basis of 404 male juniors 
tl Seniors, comprising the sample of 
l€ main study before eliminations 
Ue to deficiencies on other instru- 
Ments, a mean of 13.6 and a standard 
deviation of 6.0 was obtained for the 
Nal form of the inventory: The scores 
lad a basically unbroken and sym- 
Metrical distribution. The reliability 
a8 estimated by \uder-Richardson 
Formula Number 20 was found to be 
82. 


the 

Table 1 shows the results of 
analysis of variance used to test the 
Significance of the differences among 


‘ for 
he security scores 
e means of # of anticipated 


eac categories : 
Foot tee, “propressively bighsr ee- 
curity acer means are a ia 
‘ er and generally 
Drogressively hoy a The iaeca size 
of the JF ratio obesined mpciontes dit 


319 


TABLE 1 
Security Score AnD INDICATION OF 
ANTICIPATED JOB SITUATION 


(N = 346) 

Anticipated job situation Bg SD N 
Work for self 11.27] 5.19 | 94 
Work in civil service 17.94] 5.80 | 35 
Work for alarge company} 14.44) 5.97 | 176 
Work for a small com- | 13.88) 5.27 | 41 


pany 


F = 13.07, p < .01 


ferences significant at the .01 level. It 
should be noted that the mean security 
score of those who anticipate working 
for themselves is approximately a full 
standard deviation below the mean 
security score of those who plan to 
work in civil service. The above find- 
ings also serve to validate further the 
Security Inventory itself, for the 
scores on the inventory are in the di- 
rection of the actual job situation 
choices. 

Table 2 presents the correlations of 
the security score with each of the 
personality variables of the EPPS. 
Eight of the 15 coefficients and the 
multiple correlation coefficient of se- 
curity score with the 15 EPPS vari- 
ables are significant at the .01 level. 
One additional coefficient is significant 
at the .05 level only. Each relation- 
ship is in a direction consistent with 
the basic conceptualization of a desire 
for security. The personality charac- 
teristics most closely associated with 
emphasis on security in choosing a job 
or an occupation, are a desire for Or- 
der and an avoidance of Change. 

Of the biographical variables ex- 
amined by correlational analysis, two 
yielded coefficients with security score 
significant at the .01 level. High desire 
for security in choosing a job or an 
occupation shows a slight tendency to 
be associated with lower status f 
father’s occupation (r = .18) and sith 
frequent religious service attendance 


320 


TABLE 2 


Means anp STANDARD DEVIATIONS OF THE 
Epwarps PERSONALITY VARIABLES AND 
CoRRELATION COEFFICIENTS, REGRES- 
SION WEIGHTS, AND THE MULTIPLE 
CoRRELATION COEFFICIENT OF THE 
Security Score witH THESE 


VARIABLES 
(N = 346) 
Regres- 
é Coefii- i 
Variable Mu SD Gene ae ; 
Security 13.87 | 5.99 
Achievement 16.92 | 4.23 —.22°° | —.13 
Deference 10.72 3.28 ree -07 
Order 11.05 4.44 -32°* -22 
Exhibition 14.69 3.53 —.05 06 
Autonomy 14.92 4.28 — 27°" | —.Et 
Affiliation 13.50 3.87 09 08 
Intraception 15.86 4.69 —.06 —.02 
Succorance 9.76 4.37 s21e* ell 
Dominance 17.36 | 4.72 —.22°* | —.10 
Abasement 11.59 | 4.74 -23°° 12 
Nurturance 12.82 | 4.58 12° 0 
Change 15.52 | 4,57 —.4e* | —.21 
Endurance 13.90 | 5.12 10 05 
Heterosexuality 17.42 | 5.23 -02 10 
Aggression 13.97 | 4/43 —-08 02 
Consistency 11.45 | 2.09 | —.05 
Multiple .54e% 


Note.—Coefficients are all product-moment. 
* Significant at .05 level. 
** Significant at .01 level. 


TABLE 3 


Securrry Score anp Fretp or 
Masor Srupy 1n Coutece 


(N = 346) 

Field of major study Mu SD N 
Business - 14.94 5.90 118 
Engineering 13.84 4.87 49 
Humanities 13.00 7.16 21 
Pre-law : 11.13 7.95 16 
Pre-medical and | 12.12 | 4.93 33 

pre-dental 
Science 12.75 5.34 
Social Studies 436.| e990 | 43 
Teaching 15.81 5.57 16 

F = 2.06, p < .05 
(r = .18). Three of the variables 


yielded coefficients with security score 
significant at the .05 level only. Em- 
phasis upon security in choosing a job 
or an occupation shows a slight tend- 
ency to be associated with poor health 


STUART H. BLUM 


(r = .12), little athletic participation 
in college (r = .11), and a shorter re- 
sponse to the question concerning oc- 
cupational goals (r = .11). 

Of the biographical variables ex- 
amined by analysis of variance, only 
the means of the security scores for 
each of the fields of major study in 
college gave an F ratio significant at 
the .05 level. 

The hypothesized expectations were 
thus confirmed, and for college upper- 
classmen at least, it can be concluded 
that emphasis upon a desire for secu- 
rity in choosing a job or an occupation 
exists as a trait of some comprehen- 
siveness that can be measured reliably, 
is positively related to the actual 
choice of a job situation of a secure 
type, and is characterized by low, but 
statistically significant personality 
and background correlates. 


Discussion 

It is likely that what might be 
classed as chance or accident plays a2 
important role in an individual’s selec 
tion of a job or an occupation. None 
theless, previous investigations hav® 
pointed to the relative importance ? 
the desire for security as a factor i? 
this decision. The investigation 07 
which this report is based has demon- 
strated that although statistically si8° 
nificant relationships exist between the 
variable and both normal personality 
variables as measured by the EP. 
and background factors, these rela 
tionships are small. For practical pu! 
poses, therefore, it would often see™ 
desirable to include a measure of a? 
individual’s desire for security i? 
choosing a job or an occupation as # 
part of vocational guidance and pe!- 
sonnel selection procedure. The results 
of a more comprehensive and clinically 
derived measure of basic phychologi- 
cal security-insecurity might well pos- 
sess a high positive relationship with 
the score obtained on the Security 
Inventory. However, from the view- 


DESIRE FOR SECURITY AND VOCATIONAL CHOICE 


point of administrative efficiency, as 
well as from that of validity for the 
vocational guidance and/or personnel 
selection situation, the use of the more 
specific device appears indicated. 

The differences among security score 
means for each of the fields of major 
study in college provide the basis for 
an additional measure in guidance 
practice. For example, a high security 
score would be deviant for a pre-law 
student, and might suggest further in- 
terview, since the group comprising 
this academic major possesses the low- 
est mean score. 

It may be argued that the measure 
of desire for security is contaminated 
by the educational or occupational 
choice the person has already made. 
When an individual has chosen a 
field of major study in college that 
Prepares for a specific profession or 
vocation, he has to some extent made 
4n occupational commitment. He may 
have chosen the field for reasons other 
than a desire for security. Yet the 
choice may have influenced him to se- 
lect alternatives that were consistent 
With this choice when filling out the 

curity Inventory. The individual 
Who makes an occupational commit- 
Ment, presumably on the basis of 
other than security reasons or motives, 
Nonetheless, has probably in some 
Manner taken security into account. 
It has not bee determined which of 
these two elements of motivation takes 


Precedence. 
SuMMARY 
e often emphasized, but 
bee factors related to vo- 
cational choice is the desire for secu- 
rity. A 40-item inventory designed 
Seciienaty: io: eoamnare: We Here af 
aoe : ca Jace upon security in vo- 
phasis Pp He male college upper- 
cational chole’ veloped. Results on 
classmen, x ventory, the Edwards 
this Security ference Schedule, and a 
Personal Pre’ scertain background 
questionnaire 


321 


information were obtained from a het- 
erogeneous group of 346 upperclass- 
men. It was found that emphasis upon 
a desire for security in choosing a job 
or an occupation, exists as a trait of 
some comprehensiveness that can be 
measured reliably, is positively re- 
lated to the actual choice of a job situ- 
ation of a secure type, and is charac- 


terized by low, but statistically 
significant, personality and back- 
ground correlates. 

REFERENCES 


Carrow, T. The sociology of work. Minne- 
apolis: Univer. Minnesota Press, 1954. 

Davauerty, C. R. Labor problems in Ameri- 
can industry. (Sth ed.) New York: 
Houghton-Mifflin, 1941. 

Epwarps, A. L. Manual of the Edwards 
Personal Preference Schedule. (Rev. 
ed.) New York: Psychological Corpora- 
tion, 1957. 

Haver, F. A. The road to serfdom. Chicago: 
Univer. Chicago Press, 1944. 

Horrocx, R. Job satisfaction. New York: 
Harper, 1935. 

Liypau, L. G. What makes a good job? 
Personnel, 1949, 25, 263-266. 

Loree, I. & Buav, R. D. Do the better 
qualified compete for public jobs? Publ. 
personnel Quart., 1942, 3, 29-36. 

McGrecor, D. Conditions of effective lead- 
ership in the industrial organization. J. 
consult. Psychol., 1944, 8, 55-63. 

Nouan, C. Y. Attitude differences among 
disparate Air Force specialties. USAF 
Personnel Train. Cent. res. Rep., 1956, 
No. TN-56-88. 

Riesman, D., Denny, R., & Grazer, N. The 
lonely crowd: A study of the changing 
American character. (Abr. ed.) Garden 
City, N. Y.: Doubleday, 1953. 

Ror, Anne. The psychology of occupations. 
New York: Wiley, 1956. 

Swartie, C. L. Occupational information: 
Its development and application. (2nd 
ed.) New York: Prentice-Hall, 1952. 

Tuornvike, R. L., & Hacen, E. P. Attitudes, 
educational programs, and job experi- 
ences of airmen who did not reenlist 
USAF Personnel Train. Cent. res. Rep., 
1957, Aen EEO! - 

VITELES, - S. Motivation and r i 

i TUBS ON York: Norton, Neg ms 

Vuyte, W. H. The organization ny 
York: Simon & Schuster, 19560 =e 


(Received June 21, 1961) 


Journal of Educational Psychology 
1961, Vol. 52, No. 6, 322-329 


THE EFFECTS OF MANNER OF SPEECH ON 
APPRECIATION OF SPOKEN 
LITERATURE 


GEORGE M. GLASGOW: 
Fordham University 


Esthetic appreciation is recognized 
as an important objective of education 
and condition of artistic development. 
In the study and teaching of litera- 
ture, there is a need for identification 
and measurement of elements of com- 
munication which promote the ap- 
preciation of literature. Hence, the 
present investigation has been de- 
signed to measure the effects of certain 
speech characteristics on audience ap- 
preciation of descriptive prose and 
lyric poetry. The problem is to deter- 
mine whether or not variations in the 
manner of speaking the literature in- 
crease or decrease audience apprecia- 
tion. 

Theoretically, the problem has as- 
sumed psychological relevance with 
the appearance of a thoughtful survey 
of the literature on esthetics by Pratt 
(1961). He refers to the concept of 
“tertiary qualities” (p. 80): “the ex- 
pression of art as a property closely 
bound with the perceptual structure” 
of the person who responds as an as- 
pect of esthetic theory to be explored 
further. In this inquiry, the expressive 
qualities of elements of speech have 
been varied in terms of postulated 
“good” or desirable and “poor” or un- 
desirable modes of presentation. The 
operations for exploring the esthetic 
experiences of human observers are 
the elicitation of discriminatory reac- 
tions upon hearing the varied presen- 
tations. 


+The author wishes to express gratitude 
for the review of this manuscript by Doro- 
thea McCarthy. 


PROBLEM 


The expressive elements of speech 
tested were voice quality, pitch, rate, 
and enunciation. Within each of these 
elements the effects of a good manner 
were contrasted with those of a poor 
manner in terms of their influence oD 
audience appreciation. Each speech 
element was treated as the only varl- 
able in one experimental unit. There 
were thus four experimental units. In 
the first unit, the effects of a good 
voice quality were compared with the 
effects of a nasal twang quality. In 
the second unit, varying pitch was 
compared with a monopitch. The third 
unit treated moderate rate versus fast 
rate; and the fourth unit, distinct 
enunciation versus indistinct enuncia- 
tion. : 

A difference exists between the di- 
rect color stimuli of a painting, or the 
audible tones of music, and imagine 
color or audible stimuli which result 
from hearing spoken-word image sy™" 
bols. There is but one hue or shade 0 
ted at any point of a picture. But the 
theoretical number of specific hues oF 
shades of red which may be imagine 
m response to the spoken word “red 
may be many and may be limited onlY 
by the Spectra of one’s imagination 
and the cumulative layers of experi 
ence. The present investigation eX- 
plores the relationship between the 
Sound spectra of voice and speech and 
associated variations in audiences’ vis- 
ual and auditory images, moods, ideas, 
and literary values. 

Appreciation was considered as a 


322 


MANNER OF SPEECH AND LITERATURE APPRECIATION 323 


mental and emotional response to an 
object, situation, or condition in which 
percepts, concepts, images, and evalu- 
ations are personalized in the mean- 
Ings and feelings of the appreciator. 
Appreciation has been viewed as self- 
expression. “The lyric poem is appreci- 
ated by us as an expression of our in- 
ner life” (Parker, 1920, p. 40). As 
Self-expression, appreciation would 
Seem to overlap “creativity,” ex- 
plained by Rhodes (1961, p. 305) as a 

phenomenon by which a person com- 
municates a new concept (which is the 
product) .” 

The process of appreciation may be 
treated analytically and synthetically. 
Parker (1920, pp. 188-193) has re- 
ferred to the elements of literary ap- 
Preciation, e.g., images, sound sensa- 
tions, meanings, feelings, the rhythm 
of ideas; but these find embodiment 
and unity in the meaning of the whole 
esthetic experience. 

The measurement of appreciation 
and esthetic experience has been stud- 
led by numerous investigators. Speer 
(1929) conducted research on the 
“Measurement of Appreciation in 
Poetry, Prose, and Art and Studies in 
Appreciation.” Langfeld (1920) and 
Birkhoff (1933) evaluated the “aes- 
thetie attitude” and esthetic measures. 
Recently Thurstone (1959) described 
& method of measurement which seems 
to be basically comparable to one of 
the methods used in the present study, 


8s follows: 


scale of prefere 
be that point at 
as to whether he 

'y object as 
Point is then cae 
dislikes... : We tie are very satisfactory (p, 
++. an e 


181). 


In this investigation, good quality 
was a pleasant, male baritone quality. 
The nasal twang was produced by a 
constriction of the nasopharynx. Good 
pitch was an appropriately varying 
intonation. The monopitch was a sus- 
tained pitch of a piano “middle C.” 
The good rate was 130 words per min- 
ute; the fast rate was 250 words per 
minute. Good enunciation was distinct 
but not “artificial” or overprecise; 
poor enunciation was moderately in- 
distinct and portions of it were slightly 
slurred. 


Meruop 


The investigation consisted of two one- 
group experiments conducted with two sim- 
ilar but different groups. The second experi- 
ment was essentially a replication of the 
first, with the exception of changes in the 
order of presenting the good-poor sequences 
and their specimens. The purpose of repeat- 
ing the experiment with a second group was 
to secure additional evidence and to provide 
opportunities for changes in the order of 
rotation. 

In each experiment, the “good” level of a 
speech element, e.g., quality, was applied to 
one member of a pair of literary specimens 
and presented to an audience and their ap- 
preciation of the spoken specimen was meas- 
ured. Then the “poor” level of the same 
speech element was applied to the paired 
mate of the first specimen and presented to 
the same audience and their appreciation of 
the second specimen was likewise measured. 
If the good level of a variable was presented 
first, the poor level immediately followed, 
and if the poor level was presented first, the 
good level immediately followed. 

When the good level of any variable was 
presented in a “set,” all other variables and 
all other recognizable speech characteristics 
in the same good presentation were kept 
good as far as was possible. When the poor 
level of the same variable was presented, all 
speech factors remained good as far as pos- 
sible but the single, poor speech level. Thus 
any differences between the audience’s ap- 
preciation of the first and the second speci- 
mens spoken in the good and the poor 
speech levels were attributed to the different 
effects of these levels on appreciation. 

Literary Specimens. It was recognized 


324 GEORGE M. 


that the two literary specimens used to ex- 
press the two different levels of any variable 
should be alike in order that the same po- 
tentials of appreciation would be available 
on which these good and poor levels could 
exert their possibly varying expressive in- 
fluences. Exact duplicates, however, would 
have entailed the use of the same literary 
specimen for any good-poor set, since every 
literary specimen is unique, and this would 
have involved the repetition of “content” 
for every second presentation in a good-poor 
set. Consequently, the use of a pair of simi- 
lar matched specimens was considered to be 
preferable, conditioned with certain safe- 
guards to enhance the value of the results. 
First, two pairs of prose and poetry speci- 
mens were used for each speech element in- 
stead of the minimum of one pair. Second, a 
rotation was employed for the order of pres- 
entation of the good and poor levels of each 
element, for the speech elements themselves, 
and for the particular specimen in a pair 
used to express the good or the poor level 
of a speech element. It may be noted that 
“The rotation method ...is frequently used 
when ...there is doubt concerning the equiv- 
alence of the groups” (Good, Barr, & Scates, 
1938, p. 495). In this experiment, there was 
doubt concerning the equivalence of the 
two specimens of each literary pair. Hence 
the three factors mentioned were rotated. 
Because of the nature of the experiment, it 
was not necessary to equate the two groups, 
subject against subject. Third, although the 
experiment with the first group was com- 
plete, the basic experiment was repeated, as 
indicated, in order to secure additional evi- 
dence and to provide opportunity for further 
rotation of the speech levels and specimens, 
Pairing of Specimens. The matching of 
the literary specimens in each pair? was done 
by 10 experienced high school and college 
English teachers on the following bases: 
suitability of content for the grade level 
similarity of literary style, potentials for ap- 
preciation, length, literary value, and rela- 
tive unfamiliarity to the subjects. The pairs 
were designated: A-B, C-D for quality; B-F 
G-H for pitch; I-J, K-L for rate; and M-N. 
O-P for enunciation. , . 
Subjects. The two groups of subj 7” 
sisted of 90 and 136 fourth Pada en 
sophomore girls. The groups approximated 


* Typical of a “pair” of poetry specimens 
were “Love Song from New England” by 
Winifred Wells and “I Shall Not be Afraid” 
by Aline Kilmer. Typical of the prose speci- 
mens were selections from Anne Lindbergh’s 
North to the Orient. Prose specimens were 
limited to 260 words each. 


GLASGOW 


each other to the following extent: All mem- 
bers of both groups were of the same grade 
level and sex. Only those with normal hear- 
ing, as determined by two audiometer tests, 
were included in either group. The mean 
IQ for Group I was 105.5 and for Group II, 
105.1, with standard deviations of 11.9 and 
10.3, respectively. There were no significant 
differences in the distributions of the IQ 
scores. English grades were also considered 
as possible indexes of the students’ potential 
to appreciate English literature. Averages of 
four English grades yielded a mean of 78% 
with a range of 67-91% for Group I and 76% 
with a range of 63-91% for Group II. In 
view of the mean IQ and mean English 
marks, it was believed the two groups were 
similar samples of female high school stu- 
dents for purposes of this experiment. 

Judging of Specimens. Three experienced 
university speech teachers agreed unani- 
mously as to the goodness and poorness of 
the opposite levels of each variable. They 
also agreed as to the basic goodness and 
sameness of the other recognizable speech 
aspects in both presentations of each “set” 
for each variable. The good characteristics 
were judged to be esthetically pleasing, t0 
express suitable emotional states, to sustaiD 
attention, and to provide appropriate in- 
tonation, emphasis, etc. The good manners 
were believed to be better than the average 
comparable characteristics of college stu- 
dents; the poor manners moderately worse- 

Recordings. High-fidelity recordings were 
used in order to keep the presentations coD- 
stant, to insure that the speech examples 
which were approved were the ones use 
during the procedure, and to isolate speech 
as the experimental factor apart from ges- 
tures and facial expression, All the spee¢ 
examples were recorded by the writer. 

Rotational Sequence. The pairs of literary 
= were presented in good-poor sets. 
* set was a pair of specimens with one mem~ 
ber spoken in the good level and the othet 
2 the poor level of the same speech element. 

he sets were presented to Group I in two 
Series as follows: quality, A good-B poor; 
pitch, G poor-H good; rate, I good-J poor; 
enunciation, O poor-P good. In the second 
series the second set of specimens for each 
variable was presented in the opposite set 
Sequence, namely, quality, C poor-D good; 
Pitch, E good-H poor; rate, K poor-L good; 
enunciation, M good-N poor. The sequence 
was the same for prose. 

In the procedure with Group II, the set 
Sequence was the reverse of that used for 
Group I. Specimen A was presented with 
poor quality and B with good quality; G 
with good pitch modulation and H with 


MANNER OF SPEECH AND LITERATURE APPRECIATION 


monopitch. The remaining variables fol- 
lowed a similar order. 

Measurement of Appreciation. The ex- 
periment required an appreciation test which 
would show changes of expressed apprecia- 
tion in accordance with differences in the 
good and poor presentations of the variables. 
Since such tests were not available, it was 
necessary to construct original tests for this 
study, 

__ The tests were basically check sheets hav- 
ing steps representing different degrees of 
Tesponse. Two techniques of measuring ap- 
Preciation were included in each test given. 
The first technique secured as indexes of 
appreciation the listeners’ estimates of one 
of three degrees of the presence of each of 
ve specified appreciation elements in each 
Selection heard plus the extent to which the 
listeners liked each element. The three de- 
8rees of presence from which one was to be 
checked were strong, medium, and weak. 
The three steps of liking for the elements 
Were strong, indifferent (neutral), and dis- 
ing. 

The appreciation elements listed for 
Prose were recognition of topic, clarity of 
thought, unity, vitality, and sense imagery. 
Those for poetry were recognition of theme, 
mood, force of emotion, sense imagery, and 
thythm. 

The second technique of testing secured 
the listeners’ estimates of the literary merit 
of each whole specimen heard plus their 
personal preference for each whole specimen. 
Five levels of literary merit were mentioned: 
very superior (a masterpiece), superior, 
average textbook material, standards of a 
less important magazine, and below news- 
Paper standards. In treating the data later 
the first two levels were combined into one 
termed “high” and the last two into one 
called “low.” Hence, literary merit as 
treated became high, medium, and low. The 
levels of preference were also high, medium, 


and low. 
RESULTS 


ements 


jn treating the data 
‘nation of each score on 
aa the ont of each appreciation 
e “prese th the parallel score on lik- 
element ee 9 form a joint index of 
ing in order plus emotional response 
perception? This provided five 


jmen. 
ie ae poetry and prose se- 
indexes 


eve % 
lection heard by cciainiani 


Appreciation B l 
The first steP 


Since each presence score could be 
combined with any of the three degrees 
of liking, there were nine possible com- 
binations of presence and liking for 
each of the five elements in every se- 
lection. These were: strong presence, 
Number 1, with any of the three de- 
grees of liking; medium presence, 
Number 2, with any of the three de- 
grees of liking; and weak presence, 
Number 3, also with any of the three 
degrees of liking. In such a circum- 
stance, the presence-liking ratio pro- 
vided three cells of perfect agreement, 
namely 1:1, high; 2:2, medium; and 
3:3, low appreciation. 

The scores for the two good speci- 
mens in each two sets for each varia- 
ble were averaged and the scores for 
each two poor specimens opposed were 
likewise averaged to form good and 
poor categories. The percentages of the 
scores in each of the nine presence-lik- 
ing cells were computed for the good 
and the poor manners of each variable 
in prose and in poetry. These percent- 
ages showed that from 75-87% of all 
scores had been placed in the cells of 
perfect agreement: the 1:1, the 2:2, 
or the 3:3. 

The data showed that the percent- 
ages of subjects expressing high ap- 
preciation of the individual elements 
were consistently greater when the 
speech manner was good than when it 
was poor. The number expressing low 
negative appreciation was consistently 
greater when the speech manner was 
poor. 

The next step was the computation 
of an appreciation index which was an 
average of all five elements for each 
specimen, good or poor, heard by each 
subject. 

These indexes for both groups of 
subjects were so similar that the scores 
of both groups in the same categories 
were combined from here on. The dis- 
tributions of average response scores 
of Group I and Group II were found 


326 


GEORGE M. GLASGOW 


TABLE 1 
PERCENTAGES OF SuBJECTS IN THE HicH, Meprum, anp Low APPRECIATION 
Ce.is By Type or SprecH PRESENTATION OF Each VARIABLE IN 
Prose AND PorETRY AND THE CHI SQUARES 


Quality Pitch Rate Enunciation 
Appreciation 
Good Poor Good Poor Good Poor Good Poor 
Prose 
i iti 5 
High positive 40 19 44 27 55-29 47 
Medivars 33.38 30 32 230-33 3129 
Low negative 4 17 5 15 3 15 3 53 
Chi squares* 66.61 37.23 69.11 337.65 
Poetry 
High positive 41 18 42 16 44015 44 8 
Mediocre 26 434 29° «(29 31385 330-32 
Low negative 4 17 5 27 3 25 2 36 
Chi squares* 78.15 114.32 185.85 231.78 


* All chi squares are significant at beyond the .001 level of confidence, 


to conform very closely to each other 
in comparable categories. Eight com- 
parisons were made, namely, four for 
the presence scores and four for the 
preference scores. The four compari- 
sons of presence-score distribution 
were composed of two prose compari- 
sons, good and poor, and of two poetry 
comparisons, good and poor. The same 
comparisons were made for preference. 
In none of the eight instances, could 
the “null hypothesis” of “no differ- 
ence” be rejected at the 5% level of 
confidence. The comparisons were 
made by means of the Kolmogoroy- 
Smirnov two-sample test (Siegel, 
1956, p. 131). No differences between 
the cumulative percentage distribu- 
tions of Groups I and II exceeded the 
critical value of 18%. These findings 
were considered sufficient for assuming 
that the scores for Groups I and II 
were drawn from the same population 
and could therefore be combined. 
Table 1 lists the percentages of the 
total number of subjects who expressed 
high positive (1:1), mediocre (2:2), 
and low negative (3:3) appreciation 
in each good and poor category of each 


variable and the chi squares for the 
“Independence” of the good and the 
poor speech versus the three levels % 
appreciation. ‘ 
In Table 1 variations in appreci@- 
tion scores accompanying variations 
in speech manner are evident. 7 
number of subjects who expressed high 
appreciation was markedly and coD- 
sistently greater when the speech man- 
ners were good than when they wer 
poor. In prose, for example, the good- 
poor percentage ratios of 40:19 for 
high appreciation of quality throug! 
to 47:5 for enunciation reveal two t0 
nine times as much high appreciatioP 
when the speech styles were good 48 
when they were poor, Similar ratios 
exist in poetry. Moreover, low nega- 
tive appreciation of prose and poetry 
was greater when the speech manners 
were poor. ; 
There is an interesting similarity 
between the parallel scores for prose 
and poetry. For instance, the good- 
poor ratio for high appreciation under 
quality is 40:19 for prose and 41:18 
for poetry; enunciation, 47:5 for prose 
and 44:8 for poetry. Such percentages 


MANNER OF SPEECH AND LITERATURE APPRECIATION 


would seem to indicate that speech 
manners affect appreciation somewhat 
similarly even though the types of lit- 
erature may differ. 


Whole Literary Specimens 


: The percentages of the subjects’ es- 
timates of the literary merit of, and 
preferences for, the whole specimens 
are presented in Table 2. Since the 
high and low levels of merit and pref- 
erence tell the essential story, the mid- 
dle level is omitted here. Chi squares 
are listed for the independence of the 
high, medium, and low levels of liter- 
ary merit and preference, respectively, 
versus the good and the poor styles. 
Chi squares were also computed for 
the good and the poor frequency de- 
Viations of the combined four vari- 
ables from their theoretically equal 


327 


(1:1 basis) distribution on the high 
appreciation and the low appreciation 
levels. The chi square for high appre- 
ciation was 2000.52 and for low ap- 
preciation, 2862.84. As the chi squares 
indicate, the differences are highly re- 
liable. 

Table 2 shows the reduction of the 
subjects’ ratings of the literary merit 
and their personal preferences for the 
specimens oceurring concomitantly 
with the poor style of each speech var- 
jable. Nasality, monopitch, fast rate, 
and indistinct enunciation notably de- 
creased the number of persons who 
rated the literary merit high in com- 
parison with the number when the 
speech manners were good. The num- 
ber of ratings of low literary merit and 
of low preference for specimens was 
consistently greater when a poor 


TABLE 2 


Percentages oF SUBJECTS 


To LivTeRARY SPECIMENS AND PREFERENCES F 
SENTATION AND THE Cur SQUARES 


Each Manner or Sreecn Pr 


Asstaning HicH AND Low Literary Merit 


OR THE SPECIMENS IN 


Quality Pitch Rate Enunciation 
Good Poor Good Poor Good Poor Good Poor 
Prose 
Merit: 
High 20 6 19 7 32.18 25 4 
Low 31 56 36 57 20 41 29 81 
Chi squares 67.76 45.89 61.14 235.14 
Pr 5 
Sean 30. ll 30 16 48 22 |S 
Les 10 - 11 27 7 28 7. 165 
Chi squares 97 . Al 44.58 76.29 333.00 
Poetry 
Merit: 40 14 34 =C«@ 31 53. «12 
pie pz ; 52 22 «56 25 53 13° «63 
OW * 
Chi squares 97.26 126.77 86.20 249.80 
Preference? 45 18 ago 47 «19 Aa 
High ee a 4 2 3 30 ‘ ae 
LOW al 186.52 143.33 315.77 


Chi squares 


328 


speech level was used. This held true 
for all four speech variables. Poor 
enunciation was associated with 
greater changes in appreciation than 
any other variable. 


Biserial Correlations 


As evidence concerning the signifi- 
cance of the deviations of the good 
versus the poor levels of each speech 
element, biserial correlations were 
computed to show the extent of the 
differentiation between the two levels 
as they operate through quality, pitch, 
rate, and enunciation of both prose 
and poetry. The biserial correlation 
coefficient has been interpreted as a 
measure of the degree of divergence of 
the effects of the good and the poor 
levels from equality, 

In computing these coefficients, each 
subject’s average of the two good 
scores for the presence of the apprecia- 
tion elements was combined with the 
average of the two good literary merit 
scores rated for each variable in order 
to form a composite presence score. 
The same was done with the scores in 
the poor category. Similarly, each sub- 
ject’s scores for preference for the ap- 
preciation elements were combined 
with her preferences for the whole 
selections in order to form a single 
preference score for the good and the 
poor categories, respectively, for each 
variable. These coefficients appear in 


TABLE 3 
Tur Bisertat Correration Cogrrrictents 
FOR THE Goop AND THE Poor LEvELs 
FOR PRESENCE AND Prererencn IN 
PROSE AND Porrry 


Presence Preference 
Variable 
Prose | Poetry Prose Poetry 
Quality -55 -70 55 exalt 
Pitch -46 -73 44 -78 
Rate -53 71 -55 -76 
Enunciation -95 -94 -95 -93 


GEORGE M. GLASGOW 


Table 3. As may be seen, the coeffi- 
cients for enunciation are considerably 
higher than for any other speech ele- 
ment, a finding which suggests the 
greater effect of enunciation upon ap- 
preciation. 


SuMMary 


A “good” and a “Door” manner of 
each of four speech variables (voice 
quality, pitch, rate, and enunciation) 
were used to present 16 paired speci- 
mens of prose and 16 specimens 0: 
poetry to 226 high school sophomores. 
The speech manners were good quality 
versus nasal twang, varied pitch ver- 
Sus monopitch, moderate rate versus 
fast rate, and distinct versus indistinct 
enunciation, A markedly greater num- 
ber of subjects expressed high appre¢!- 
ation of and strong preference for the 
selections, and also rated the selections 
as of high merit, when the speech maD- 
ners were good than when they were 
poor. 

The number of subjects who rated 
the literary merit high was greater un- 
der the good speech manner than u2- 
der the comparable poor manner 0 
every variable. The number who rate 
the literary merit low was greater uD- 
der every poor speech manner. : 

The number of subjects expressing 
strong preferences for the whole selec- 
tions was increased by every g00 
Speech manner and decreased by every 
Poor speech manner. 

Poor enunciation decreased high ap- 
preciation and increased low apprecia- 
tion in terms of the comparable goo 
manner, more than the poor manner of 
any other speech element. 

All the chi squares are significant at 
beyond the .001 level of confidence. 

Thus, different manners of speaking 
literature have different effects on au- 
dience appreciation and audience judg- 
ments of literary merit. Consequently; 
it is believed that speech manner jg an 


MANNER OF SPEECH AND LITERATURE APPRECIATION 


important factor in the educational 
development of potential literary ap- 
preciations and insights. 


REFERENCES 


Binkuorr, G. D. Aesthetic measure. Cam- 
bridge: Harvard Univer. Press, 1933. 

Goon, C. V., Barr, A. S., & Scares, D. E. The 
methodology of educational research. 
New York: Appleton-Century, 1938. 

Layorenp, H. Aesthetic attitude. New York: 
Harcourt, Brace, 1920. 


Parker, D, H. The principles of aesthetics. 


New York: Silver Burdett, 1920. 


329 


Pratt, C. C. Aesthetics. Annu. Rev. Psychol, 
1961, 12, 71-92. 

Ruopves, M. An analysis of creativity. Phi 
Delta Kappan, 1961, 42(7), 305-310. 
Stecez, S. Nonparametric statistics for the 
behavioral sciences. New York: Mc- 

Graw-Hill, 1956. 

Speer, R. Measurement of appreciation in 
poetry, prose and art and studies in ap- 
preciation. Teachers Coll. Contrib. 
Educ., 1929, No. 362. 

Tuourstone, L. L. The measurement of val- 
ues. Chicago: Univer. Chicago Press, 


1959. 
(Received June 21, 1961) 


Journal of Educational Psychology 
1961, Vol. 52, No. 6, 330-334 


MEASURING THE VALUE SYSTEMS OF 
EDUCATION PROFESSORS: 


DONALD J. VELDMAN, ROBERT F. PECK, ann CARSON McGUIRE 


University of Texas 


The basic idea of this study can be 
applied to any situation where two or 
more judges make quantified assess- 
ments of people on a series of char- 
acteristics. Most factorial studies an- 
alyze the relationships between scales 
(R technique) or between subjects (P 
technique) after averaging or combin- 
ing the scores given by different 
judges. The point of the present study, 
however is this: If each judge’s ratings 
on the total population are separately 
factor analyzed, it becomes possible to 
identify his particular way of con- 
struing the characteristics under study. 
Furthermore, his assumptions (often 
unconscious) about the way he thinks 
the characteristics relate to one an- 
other are directly shown by the factors 
that emerge from his ratings. The 
present method resembles Kelly’s 
(1955) “personal construct” approach, 
but is applicable to any set of judg- 
ments, regardless of their specific con- 
tent. 

Tf, upon comparison, all or most of 
the judges turn out the same factor 
structure, they may be said to show 
high reliability. Moreover, such agree- 
ment can be regarded as evidence of 
construct validity for the dimensions 
of human nature on which they all 
agree. It must be remembered, how- 
ever, that agreement does not neces- 
sarily prove the common set of con- 
ceptions to be valid. All it reflects is a 
common frame of reference among the 
Judges. Validation against external ey- 


*This research is a part of the Mental 
Health in Teacher Education Project, Col- 
lege of Education, University of Texas. This 
project is supported by Grant No. 2M-6635 
from the Training Branch, National Insti- 
tute of Mental Health. 


idence does continue to have a persua- 
sive desirability of its own. ; . 

A very interesting possibility ATIses, 
however, when disagreement is uD- 
covered. When comparison of the me 
eral factor-systems reveals significan 
differences between the individu 
judges, it is possible to interpret ea¢ 
Judge’s “unique” factors as a mere 
projection of his own special, — 
what private conceptual world. ely 
deed, it is possible to identify precise 
how many of his concepts he bold 
common with his colleagues, and Pe 
cisely how much he differs, where 0 
does differ, Moreover, the pattern a‘ 
relationships the judge expects to of 
tain among the different aspects, 

uman nature is clearly specill or 
Whether any given judge is “right’ as 
not, is a subject for further determi? 
tion; but what each judge curre? 
believes to be true is made clear. | +8 

When the measured characteris 
include many aspects of personali 
which presumably are related a 
teaching effectiveness, and the judg g 
are professors of education, econ 
prospective teachers, the findings t@ a 
on considerable practical significar 
(Peck, 1960). At least, what professors 
believe to be true about human natu? 
is apt to influence their perception gr 
evaluation of their students. bil 
they believe is also likely to color the! 
teaching and their advice to schoo 
executives, b de 

Finally, when the measures inclu 
an estimate of the “overall ment 
health” of each subject, it begonie” 
possible to determine exactly wh2 
“mental health” means to each judge 
in terms of all the behavioral char- 
acteristics included in the study. 


330 


Oe 


MEASURING VALUE SYSTEMS 


PROCEDURE 


Rater Constructs. Four judges experi- 
enced in personality assessment (two profes- 
sors, two research associates) independently 
rated 53 female sophomores in education. 
Fifty aspects of personality and behavior 
were rated on seven-point scales. A rating 
on “overall mental health” was included. 

_ The appraisals were based on the follow- 
ing battery: 

(1) Biographical Information Form 

(2) Peck Sentence Completion, 2-D (90 

items) 

(3) Group Thematic Apperception Test 

(8 pictures) 

(4) Bown School Situations Analyzer 

(5) Educational Issues Questionnaire 

Following completion of the assessments, 
the ratings were transferred to IBM cards 
and all subsequent computations were car- 
ried out with an IBM 650 data processing 
System. 

The 50 rating scales were intercorrelated 
for each rater separately. The four resulting 
intercorrelation matrices were separately 
Subjected to centroid factor analysis and the 
resulting structures were rotated by means 
of the normalized varimax method which 
Maintains orthogonality among the axes. 

4 The factors extracted in each analysis are 
scribed in Table 1, which also includes the 
Percentage of common variance accounted 
or by each factor, and the loading of the 
Overall mental health rating on each factor. 

In order to obtain an objective estimate 
of the degree of similarity among the vari- 
ous rater factors, the rotated factor loadings 
Were used as data for an intercorrelation 
analysis of the 25 factor variables. Table 2 
indicates the primary interrater factor rela- 


tionshi 

Dannii Mental Health. In Table 3 the 
major personality correlates of mental 
health are listed- These constitute the com- 
mon area of agreement about the meaning of 
“mental health’ Table 4 shows the person- 
ality variables on whose relationships to 
mental health the four judges differed. 


RESULTS 


Rater Constructs 
- arent from Table 1 that 
me Heese mploy construct systems 
aaiae fae considerably in degree of 
ae ventiation- (The percentages of 
iffe ‘ yariance are a more reliable 
pores meaningful differentiation 


331 


TABLE 1 


Factors Emrercinc 1n Eacu ANALYsIs, 
THE ComMMON VaRIANCE ASSOCIATED 
with Eacu, anp THEIR MENTAL 
HeattH SIGNIFICANCE 


of 
Sse 
Factor Factor description Sees. 
goeless 
bor sea 
ow a 
Rater A 
I Creative, perceptive, inde-| 29 5S 
pendent 
IL Emotionally and judgment-| 17 43 
ally restrained 
pose Optimistic, relaxed, peer ori- | 24 59 
ented, sociable 
Iv Child oriented, affectionate, | 10 12 
indecisive 
bs Sexually integrated 2 18 
Rater B 
I Intelligent, perceptive, inde- | 30 67 
pendent, creative 
Il Judgmentally and emotion- | 15 29 
ally restrained 
ur Sociable, peer oriented 9 37 
IV Child oriented, affectionate, | 11 40 
cheerful 
vV Self-confident, hopeful, re- 5 27 
laxed 
VW Nondogmatie, flexible 3 13 
VII Conventional, stable, expedi- 2 15 
ent 
Rater C 
I Independent, dominant, non- 33 74 
conforming 
Il Judgmentally and emotion-| 13 Al 
ally restrained 
It Affectionate, cooperative, peer | 19 84 
oriented 
IV Mild emotional reactivity 3 02 
bd Imaginative, insightful 2 | —.05 
Vil Goal oriented, emotionally 3 01 
constricted 
VIL Hopeful, sociable, rigid, 2 04 
“adult” 
Rater D 
I Creative, independent, per- | 27 63 
ceptive 
II Stable, emotionally and judg- | 16 35 
mentally restrained 
II Sociable, cooperative, affec- | 20 154 
tionate, cheerful, peer ori- 
ented 
IV Persistent, proud, “‘adult’”’ 5 | —.02 
Vv Relaxed, self-confident 3 08 
VI Rational, sexually integrated, 4 22 
goal oriented 


than is the number of factors extracted 
in each analysis.) 

_ Table 2 reveals the presence of three 
dimensions common to the construct 
systems of all four raters: 


332 


TABLE 2 
Primary INTeRRATER Factor 
RELATIONSHIPS 

Factor Rater A Rater B | Rater C | Rater D 
I |B 97 Ai 97 | Bi 94 | Cy 94 
II | Bz 92 Az 92 | Az 90 | Az 92 
IIL C3, Ds 86 | Az 86 | Ds 91 | C3 91 
IV |B, 69 Cz 87 | D2 42 | Cr 55 
Vi s| CG, 47 A; 78 | B, 43 | Bs 70 
VI = C; 66 | Ds 33 | Ci 57 

Vil — Dz; 37| Ds 55} — 


Note.—This table is a condensation of a square ma- 
trix of factor loading intercorrelations, and includes only 
the highest coefficients in each column of the matrix. 
Decimal points have been omitted. 


Common Dimension I: Creatively 
Intelligent Autonomy. Factor I ac- 
counted for the most variance in all 
analyses. Comparisons of the descrip- 
tions in Table 1 reveal only minor dif- 
ferences in emphasis among the four 
raters, except for the absence of intel- 
lectual attributes in Rater C’s cluster. 

Common Dimension II: Emotional- 
Judgmental Restraint. The key traits 
here are virtually identical for all 
raters, although Rater D placed some- 
what more weight on stability of mood. 

Common Dimension III: Peer Ori- 


D. J. VELDMAN, R. F. PECK, AND C. McGUIRE 


ented Sociability. All four raters agree 
on a core cluster of traits, with wide 
variations in the other variables they 
also include in this dimension. Rater 
B appears to define the dimension 
most narrowly, while Rater D allows 
a large number of traits nearly equal 
weight. 

Unique Dimensions. Beginning with 
the fourth factor in each analysis, the 
dimensions become more idiosyncratic. 
In some cases it is possible to identify 
a separate dimension for one rater as 
part of a broader dimension in another 
rater’s system. Considering the coefli- 
cients in Table 2, on the other hand, it 
would be difficult to regard factors As» 
Br, Cx, Cs, Ce, Dy, or Do as any- 
thing but idiosyncratic. 

Factors Ay and By are quite similar 
and seem to represent as a separate 
cluster the characteristics of affection 
and positive orientation toward chil- 
dren which Raters C and D include in 
their third common factor, Factors Bs 
and D; appear to be a good deal alike: 
They separate off one facet of Rater 
A’s third factor—relaxation—which 
only he includes in Factor III, Factor 
Be, although somewhat unique to this 
rater, represents “flexibility,” a majol 


TABLE 3 
Primary Corretates or MENTAL Hearn, AVERAGED ovER Four Raters 
Average cor- 
relation with : 
wastiamale Bipolar scale description 
82 Rational-; isti 
82 apse character vs. Nonrational-altruistic character 
81 Patonnt pres i a aa vs. Distorted, biased perception 
» objective judgment . ? ibe 
Eolely ancien ibughs as Core ponaiects judgment 
asic trus + Lo inking 
76 Full reality awareness vs. Distrust of people and life 
vs. Constricted undifferentiated pet 
75 Self-insightful ception 
74 Appropriate goals vs. Tittle self-insight 
73 High self-esteem and Aelecoultial vs. Inappropriate goals 
. : ence vs. Low self- » i 
73 Appropriate emotional expression vs. inappeapasae Sal eee 
72 Relaxed, comfortable V8 rec 
iti . ous, many specific fears 
71 Positive regard for family vs. Negative attitude eae calle 


\ 


\ 
| 
! 


MEASURING VALUE SYSTEMS 


333 


TABLE 4 
ScaLes or UNCERTAIN SicNIFIcANCE FoR Mentau Huarra 


Correlation with mental 
health scale* 4 
Bipolar scale description 
A B ic D 
28 56 48 10} Rich, complex fantasy vs. Dull, unimaginative 
life 
54 77 54 18 | Decisive, definite judg- vs. Ambivalent, tentative judg- 
ment ment . 
38 | 58 | 09 41 | Affectionate, kind vs. Undemonstrative of affec- 
tion 
74 53 67 82 | Genuine interest in others vs. Self-centered : 
53 60 62 17 | Dominant, ascendant, vs. Submissive, following, 
competitive yielding 
83 81 59 73 | Appropriate goals ys. Inappropriate goals 
58 74 80 47 | Persistent, tenacious vs. Quitting, gives up easily 
1 54 03 | —13 | Strong identification with vs. Little identification with 
children children 
* Highest and lowest coefficients differ significantly at the .01 level. 


aspect of the third dimension for 
Raters C and A. 

_Although the idiosyncratic dimen- 
Sions which are unique to single raters 
will not be discussed here, such dimen- 
Sions are of special value in identifying 
conceptions which the individual does 
not share with anyone else. This in- 
formation could be quite useful, for 
instance, in training assessment spe- 
cialists to identify and correct concep- 
tual sets which are mainly projections 
and which may distort their judgment. 


Mental Health Defimtion 


Table 3 shows the way the term 
“mental health” was actually used. It 
appears that “rational-altruistic char- 
acter?” (Peck & Havighurst, 1960) is 
equivalent to “mentally healthy” for 
these judges: Of paramount signifi- 
cance are such traits as objectivity, 
rationality, insightfulness, and_basic 
trust in other people. Self-confidence 
and appropriateness of behavior also 
are important components. 

Table 4, on the, other hand, reveals 
certain discrep2ncies among the raters’ 

- oints- Raters A and D place less 
booed tance on richness of fantasy and 


-CUCS~«~ 


on persistence than do Raters B and 
C. Rater B de-emphasizes interest in 
others, but stresses identification with 
children, as compared to the other 
raters, Rater C places less importance 
on expression of affection and on ap- 
propriateness of goals. Rater D does 
not place importance on decisiveness 
of judgment, or dominance, to the ex- 
tent that the others do. It is nonethe- 
less interesting to note that only one 
minor reversal of direction occurred in 
the correlation coefficients. 


Discussion 


Even where judges largely agree, as 
these did (the reliability of their aver- 
aged ratings was .85), the construct 
system of each judge still shows cer- 
tain unique aspects. The method of 
analyzing ratings which was used here 
reveals each judge’s system of beliefs 
about the structure of human nature. 

If, in some other study, judges were 
to disagree notably, such a method of 
analysis would identify even more 
sharply each judge’s unique set of con- 
cepts. As a study in the psychological 
beliefs of people like professors, this 
might be interesting in itself. Beyond 


334 


this, however, clear, precise identifica- 
tion of the specific points at which dif- 
ferences exist could conceivably lead 
to clearer, more conscious, and more 
rational discussion among judges, until 
consensus was reached or they agree to 
disagree. In either case, this kind of 
analysis should prove a healthy re- 
minder that the arguments are not 
necessarily about the facts of human 
nature, but may sometimes arise from 
differing individual construct systems. 
The construct systems, in this case, are 
laid out on the table, for all to see. 
Such a procedure, needless to say, is 
not limited to studies of personality. It 
could as usefully be applied to studies 
of ethical values, beliefs about edu- 
cational practices—indeed, any issue 
important enough to warrant some 
planned scaling of judgments, and a 
moderate amount of computer time. 


SUMMARY 


A generalized method was developed 
for identifying the individual construct 
systems of judges of human behavior. 
The method is that of separately fac- 
tor analyzing each judge’s ratings on 
whatever scales are used in an assess- 
ment of people. Thereafter, correlation 
of the factors generated by several 
judges indicates the nature and degree 
of similarities and differences in the 
way the different judges define “hy- 
man nature,” and the way in which 
they believe different aspects of be- 
havior are related to one another. 

In the present instance, four educa- 
tional psychologists rated 53 sopho- 


D. J. VELDMAN, R. F. PECK, AND C. McGUIRE 


more coeds, on 50 personality vari- 
ables and on a scale of “overall mental 
health.” The ratings were based on 
qualitative analysis of a battery in- 
cluding a biographical form, the Peck 
Sentence Completion (2-D), group 
TAT, Bown School Situations Ana- 
lyzer, and an Educational Issues Ques- 
tionnaire. The average interjudge cor- 
relation for all ratings was .85, Even 
So, some significant differences were 
found in the construct systems of the 
four judges. Each showed certain 
unique concepts about the structure of 
personality. Each differed at certain 
points, about the personality corre- 
lates of mental health, although there 
was agreement among all the judges 
that 12 personality variables are 
closely related to mental health. 

The general logic of this procedure 
can be applied to any kind of judg- 
ments, regardless of the specific vati- 
ables involved. It thus offers a gener- 
ally applicable way of measuring 
individual construct systems, it aP- 
pears, In any field where human beings 
make judgments. To differentiate it 
from R, P, and Q techniques, perhaps 
it might be called “J technique.” 


REFERENCES 


Ketty, G. A. The mat 
4 Gy Ae psychology of perso 
cel street. New York: Norton, 1955. 2 
ols, 
Peck, R. F Personalit; Ls 
A 3 y patterns of prosp' 
tive teachers, J, 960, 29 
169-175, noe a 
Pecs, R.F, & Havicuursr, R. J. The psi 
chology of character development. New 
York: Wiley, 1960, 


(Received June 24, 1961) 


4 


—— 


2 ae 


S 
ar 


GRADUATE 
EDUCATION 
IN 
PSYCHOLOG Y 


Report of the Conference on Graduate Education in 

Psychology, sponsored by the Education and Training 

Board of the American Psychological Association and 

supported by a grant from the National Institute of 

Mental Health, U. S. Public Health Service; held at 

= Beach, Florida, November 29 to December 7, 
58. 


Prepared by the Editorial Committee: 
Anne Roe, Chairman, 
and 
John W. Gustad, Bruce V. Moore. 
Sherman Ross, and Marie Skodak 


Price $1.50 


THE AMERICAN 
PSYCHOLOGICAL ASSOCIATION 
Dept. Grad 
1333 Sixteenth Street, N.W. 
Washington 6, D.C. 


Psychology 
and Mental Health 


A report of the Institute on Education and Training 
for Psychological Contributions to Mental Health, 
held at Stanford University in August, 1955. 


Edited by Cuartes R. STROTHER 


Topics discussed at the Institute: 


Training Needs of Psychologists in Com- 
munity Mental Health 
Specialization in Training 
- Practicum Training “ny 
Training for Therapy 
Training for Research in the Mental 
Health Field 


Problems of Departmental Organization 


Price: $1.75 


Order from: 


American Psychological Association, Inc. 
1333 Sixteenth Street, N. W. 
Washington 6, D. C. 


ee 


